Google Bot using HUGE amounts of bandwidth

Discussion in 'SEO' started by Josh.uk, Mar 1, 2007.

  1. #1
    Hey Guys

    My new site www.tubefetch.com is only 3 weeks old, and i have noticed that the site is being crawled huge amounts...especially by google bot.

    In the first 3 weeks we had 26 different robots crawl the site, and google bot ALONE used over 12GB of bandwidth. Today (first day of the month) we have had 6 different crawlers including google bot, adsense, MSN and Yahoo and Google Bot has used 1.1GB of bandwidth (today!).

    The re-visit meta tag is set to 20 days, but the google, msn and yahoo bot visited both today and yesterday.

    Does anybody know why this is? It just doesn't seem to make any sense to me (not that i am complaining!).

    Thanks
    Josh
     
    Josh.uk, Mar 1, 2007 IP
  2. ServerUnion

    ServerUnion Peon

    Messages:
    3,611
    Likes Received:
    296
    Best Answers:
    0
    Trophy Points:
    0
    #2
    Not sure, are they indexing any videos on the site? Just a thought.
     
    ServerUnion, Mar 1, 2007 IP
  3. exponent

    exponent Peon

    Messages:
    1,243
    Likes Received:
    60
    Best Answers:
    0
    Trophy Points:
    0
    #3
    You could set your .HTACCESS to keep google from caching all your images. I would set up some web exclusions as well.
     
    exponent, Mar 1, 2007 IP
  4. Josh.uk

    Josh.uk Peon

    Messages:
    178
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #4
    Thanks for the replies.....I dont know if google is indexing the videos, but even if it did, there would not be that amount of videos. Also there is nowhere near enough images on the site to cause that amount of usage.
     
    Josh.uk, Mar 1, 2007 IP
  5. exponent

    exponent Peon

    Messages:
    1,243
    Likes Received:
    60
    Best Answers:
    0
    Trophy Points:
    0
    #5
    You can find a robot exclusion .htaccess that will prevent offline browsers and non-name / malicious robots from accessing your page. Who gives a damn if johnnysearchbucket.org's home-made search robot caches your page? Just worry about Google, Yahoo, and MSN, mainly.
     
    exponent, Mar 1, 2007 IP
  6. mhdoc

    mhdoc Tauren

    Messages:
    840
    Likes Received:
    33
    Best Answers:
    0
    Trophy Points:
    0
    #6
    I have had that happen in the past. A site with less than 25 megs of total content (all static pages) suddenly started using multiple gigs of bandwidth to googlebot. It lasted for a day or two, stopped for a couple of days, did it again and then stopped. Fortunately it stopped before my site got shut down for bandwidth overages :)

    I could never tell what caused it and saw no corresponding jumps in SERP's as a result of it.
     
    mhdoc, Mar 1, 2007 IP
  7. Josh.uk

    Josh.uk Peon

    Messages:
    178
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #7
    I'll have to see how it goes...but yes i am only really worries about the main search engines, of which all are visiting daily :D
     
    Josh.uk, Mar 1, 2007 IP
  8. wmghori

    wmghori Well-Known Member

    Messages:
    1,061
    Likes Received:
    14
    Best Answers:
    0
    Trophy Points:
    160
    #8
    why not contact google about this?
     
    wmghori, Mar 1, 2007 IP
  9. kentuckyslone

    kentuckyslone Notable Member

    Messages:
    4,371
    Likes Received:
    367
    Best Answers:
    0
    Trophy Points:
    205
    #9
    I looked at your site and I can't say just what is causing this. I had a similar thing happen with one of my sites last summer. Google was chewing up over 1 gig of bandwidth in less than a weeks time. It slowed down after about a month and finally dropped back down to normal. I never did really figure out why it was happening. The site is question was straight html with no scripts at all.
     
    kentuckyslone, Mar 2, 2007 IP
  10. it career

    it career Notable Member

    Messages:
    3,562
    Likes Received:
    155
    Best Answers:
    0
    Trophy Points:
    270
    #10
    Ask google to pay for your badwidth
     
    it career, Mar 2, 2007 IP
  11. ajsa52

    ajsa52 Well-Known Member

    Messages:
    3,426
    Likes Received:
    125
    Best Answers:
    0
    Trophy Points:
    160
    #11
    Maybe are malicious bots using Google's "user agent".
    You should check their IP range (66.249.64.0 - 66.249.95.255)
    I'm verifying this on my site with this regular expression:
    /^66\.249\.(6[4-9]|[7-8][0-9]|9[0-5])\./

    Sometimes Google is using others IP and others "user agents" to ensure you're not fooling Googlebot. You can be banned from Google index if you provide different content for Googlebot and for others.
     
    ajsa52, Mar 2, 2007 IP