Indian television shows news - Justin Gatlin Doping - Free Ecards - Debt Consolidation - Find jobs

PDA

View Full Version : GoogleBot Slams My Site


TwisterMc
Sep 9th 2004, 10:13 am
Googlebot came though my site eating up 3.12 Gigs worth of badwith. OMG I better do some directory blocking before they come back and eat up another three gig because then my site will be shut down cuz I only get 5 Gigs :eek:

stephfoster
Sep 9th 2004, 10:21 am
Ouch... time for some quick decisions.

Badwith... good typo for when that much bandwidth is eaten so fast. Almost appropriate.

Jackobo007
Sep 9th 2004, 10:26 am
Googlebot came though my site eating up 3.12 Gigs worth of badwith. OMG I better do some directory blocking before they come back and eat up another three gig because then my site will be shut down cuz I only get 5 Gigs :eek:

Go on securemate.com they give unlimited bandwith for very reasonable price.

disgust
Sep 9th 2004, 10:31 am
no hosting providor can really offer "unlimited bandwidth." most "unlimited" bandwidth providors will force you to leave once you're using too much- usually around 100-200 gigs.

quality unmetered servers are the closest you'll get to unlimited, but even they are limited by the speed of the line they're connected to

mxlabs
Sep 9th 2004, 11:04 am
3 GB of transfer in 9 days caused by googlebot alone seems to be impossible. make sure you have a look at the incoming IPs carrying the "googlebot" user agent.

disgust
Sep 9th 2004, 11:24 am
assuming all of your pages were 100KB, it'd need to load about 30,000 pages to use that much. it's possible, depending on how many pages you have... but if you have THAT many pages, you're probably on a more serious hosting plan

if the average page was 50KB, it'd about 60K pageviews.

if it was 25KB, then about 120K pageviews

TwisterMc
Sep 9th 2004, 11:41 am
I think it was due to a dynamic calendar which could technically go on forever. Making hundreds of dynamic pages ;)

arestia
Sep 9th 2004, 6:29 pm
i wonder if google has any sort of procedures in the bot to figure out that the site is generating infinate pages so it doesnt get into a loop forever.

-dan

TwisterMc
Sep 10th 2004, 8:01 am
I wanna know how to control that. So, say, Google can only use up 2 gigs of bandwidth at a time. That'd be nice.

bobafind
Sep 13th 2004, 9:40 am
Or you could just robots.txt that particular script (unless you want the actual pages with calendar content to get indexed).

Hmmm..

If you wrote the calendar script and know the inner workings, you could make a calendar archive page (like forums do) with "static" html links using mod rewrite and make it all SEO'd with keyword rich event titles and such. Then block google from looking at the actual, dynamic calendar.

Hey that's not a bad idea...

TwisterMc
Sep 13th 2004, 10:34 am
I use PHPiCalendar that way it syncs my home calendar with the internet. :D I'm not writing my own script. Ohh no. ;)

Which reminds me, I should upgrade my calendar script.

kuhleen
Sep 14th 2004, 1:04 pm
how can you know how much each crawler consumes in terms of bandwidth?

TwisterMc
Sep 14th 2004, 1:18 pm
I have awstats (http://awstats.sourceforge.net/) installed

TwisterMc
Sep 21st 2004, 7:23 am
Now a spider called Pompos slammed my site using 2.5 gigs worth of bandwidth. Good thing I bought more because I'm using more bandwidth this month than ever before. However, I don't understand how it could eat up that much since blocked my dynamic calendars. I just hope I can make it through the end of the month as bandwidth is expensive from this host. :rolleyes: