1. Advertising
    y u no do it?

    Advertising (learn more)

    Advertise virtually anything here, with CPM banner ads, CPM email ads and CPC contextual links. You can target relevant areas of the site and show ads based on geographical location of the user if you wish.

    Starts at just $1 per CPM or $0.10 per CPC.

GoogleBot Slams My Site

Discussion in 'Site & Server Administration' started by TwisterMc, Sep 9, 2004.

  1. #1
    Googlebot came though my site eating up 3.12 Gigs worth of badwith. OMG I better do some directory blocking before they come back and eat up another three gig because then my site will be shut down cuz I only get 5 Gigs :eek:
     
    TwisterMc, Sep 9, 2004 IP
  2. stephfoster

    stephfoster Well-Known Member

    Messages:
    567
    Likes Received:
    17
    Best Answers:
    0
    Trophy Points:
    138
    #2
    Ouch... time for some quick decisions.

    Badwith... good typo for when that much bandwidth is eaten so fast. Almost appropriate.
     
    stephfoster, Sep 9, 2004 IP
  3. Jackobo007

    Jackobo007 Peon

    Messages:
    195
    Likes Received:
    8
    Best Answers:
    0
    Trophy Points:
    0
    #3
    Go on securemate.com they give unlimited bandwith for very reasonable price.
     
    Jackobo007, Sep 9, 2004 IP
  4. disgust

    disgust Guest

    Messages:
    2,417
    Likes Received:
    133
    Best Answers:
    0
    Trophy Points:
    0
    #4
    no hosting providor can really offer "unlimited bandwidth." most "unlimited" bandwidth providors will force you to leave once you're using too much- usually around 100-200 gigs.

    quality unmetered servers are the closest you'll get to unlimited, but even they are limited by the speed of the line they're connected to
     
    disgust, Sep 9, 2004 IP
  5. mxlabs

    mxlabs Peon

    Messages:
    327
    Likes Received:
    6
    Best Answers:
    0
    Trophy Points:
    0
    #5
    3 GB of transfer in 9 days caused by googlebot alone seems to be impossible. make sure you have a look at the incoming IPs carrying the "googlebot" user agent.
     
    mxlabs, Sep 9, 2004 IP
  6. disgust

    disgust Guest

    Messages:
    2,417
    Likes Received:
    133
    Best Answers:
    0
    Trophy Points:
    0
    #6
    assuming all of your pages were 100KB, it'd need to load about 30,000 pages to use that much. it's possible, depending on how many pages you have... but if you have THAT many pages, you're probably on a more serious hosting plan

    if the average page was 50KB, it'd about 60K pageviews.

    if it was 25KB, then about 120K pageviews
     
    disgust, Sep 9, 2004 IP
  7. TwisterMc

    TwisterMc Mac Guru

    Messages:
    972
    Likes Received:
    13
    Best Answers:
    0
    Trophy Points:
    0
    #7
    I think it was due to a dynamic calendar which could technically go on forever. Making hundreds of dynamic pages ;)
     
    TwisterMc, Sep 9, 2004 IP
  8. arestia

    arestia Peon

    Messages:
    89
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #8
    i wonder if google has any sort of procedures in the bot to figure out that the site is generating infinate pages so it doesnt get into a loop forever.

    -dan
     
    arestia, Sep 9, 2004 IP
  9. TwisterMc

    TwisterMc Mac Guru

    Messages:
    972
    Likes Received:
    13
    Best Answers:
    0
    Trophy Points:
    0
    #9
    I wanna know how to control that. So, say, Google can only use up 2 gigs of bandwidth at a time. That'd be nice.
     
    TwisterMc, Sep 10, 2004 IP
  10. bobafind

    bobafind Peon

    Messages:
    128
    Likes Received:
    3
    Best Answers:
    0
    Trophy Points:
    0
    #10
    Or you could just robots.txt that particular script (unless you want the actual pages with calendar content to get indexed).

    Hmmm..

    If you wrote the calendar script and know the inner workings, you could make a calendar archive page (like forums do) with "static" html links using mod rewrite and make it all SEO'd with keyword rich event titles and such. Then block google from looking at the actual, dynamic calendar.

    Hey that's not a bad idea...
     
    bobafind, Sep 13, 2004 IP
  11. TwisterMc

    TwisterMc Mac Guru

    Messages:
    972
    Likes Received:
    13
    Best Answers:
    0
    Trophy Points:
    0
    #11
    I use PHPiCalendar that way it syncs my home calendar with the internet. :D I'm not writing my own script. Ohh no. ;)

    Which reminds me, I should upgrade my calendar script.
     
    TwisterMc, Sep 13, 2004 IP
  12. kuhleen

    kuhleen Peon

    Messages:
    21
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #12
    how can you know how much each crawler consumes in terms of bandwidth?
     
    kuhleen, Sep 14, 2004 IP
  13. TwisterMc

    TwisterMc Mac Guru

    Messages:
    972
    Likes Received:
    13
    Best Answers:
    0
    Trophy Points:
    0
    #13
    TwisterMc, Sep 14, 2004 IP
  14. TwisterMc

    TwisterMc Mac Guru

    Messages:
    972
    Likes Received:
    13
    Best Answers:
    0
    Trophy Points:
    0
    #14
    Now a spider called Pompos slammed my site using 2.5 gigs worth of bandwidth. Good thing I bought more because I'm using more bandwidth this month than ever before. However, I don't understand how it could eat up that much since blocked my dynamic calendars. I just hope I can make it through the end of the month as bandwidth is expensive from this host. :rolleyes:
     
    TwisterMc, Sep 21, 2004 IP