1. Advertising
    y u no do it?

    Advertising (learn more)

    Advertise virtually anything here, with CPM banner ads, CPM email ads and CPC contextual links. You can target relevant areas of the site and show ads based on geographical location of the user if you wish.

    Starts at just $1 per CPM or $0.10 per CPC.

Too many googlebot hits

Discussion in 'robots.txt' started by weathor, Apr 6, 2010.

  1. #1
    My site has 42500 hits from googlebot these days


    [​IMG]


    A few days ago it had only some hundreds.. Why is that?
     
    weathor, Apr 6, 2010 IP
  2. syuxx

    syuxx Peon

    Messages:
    278
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    0
    #2
    That is so weird. And it eats a lot of your bandwiths. Never encounter this such of problems before.
     
    syuxx, Apr 6, 2010 IP
  3. weathor

    weathor Peon

    Messages:
    148
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #3
    any ideas??

    keeps raising

    [​IMG]
     
    weathor, Apr 8, 2010 IP
  4. bimbimopss

    bimbimopss Peon

    Messages:
    19
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #4
    keep raising ...
     
    bimbimopss, Apr 9, 2010 IP
  5. Madax

    Madax Peon

    Messages:
    50
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #5
    Whats is your site and does it have alot of internal and external links ?
     
    Madax, Apr 10, 2010 IP
  6. weathor

    weathor Peon

    Messages:
    148
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #6
    I believe it has started indexing each city from my base.. If not I cant explain it! When you search in England for cities starting with "a" you take these links
     
    weathor, Apr 10, 2010 IP
  7. Akhlis

    Akhlis Active Member

    Messages:
    185
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    75
    #7
    O God :)
    Much spamm from google.
    Just make your Robots.txt to dont allow robots to visit your site only 1 time per day.
    (2.2gb bandwidth lol)
     
    Akhlis, Apr 11, 2010 IP
  8. soundingloud

    soundingloud Peon

    Messages:
    302
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #8
    is this google bot spamming?
     
    soundingloud, Apr 19, 2010 IP
  9. Aaron111

    Aaron111 Well-Known Member

    Messages:
    4,301
    Likes Received:
    29
    Best Answers:
    0
    Trophy Points:
    185
    #9
    sorry to here this ..... may be use Yahoo and msn b MARKS .....
     
    Aaron111, Apr 20, 2010 IP
  10. Dan Morgan

    Dan Morgan Peon

    Messages:
    31
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #10
    The last time Googlebot started hammering my site it turned out to be from a hack where the pages were bloated but only for the search engines UA.

    Is your site new in the last year?
     
    Dan Morgan, Apr 20, 2010 IP
  11. weathor

    weathor Peon

    Messages:
    148
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #11
    it is a few months old..


    Googlebot 153978 5.45 GB 21 Apr 2010 - 01:48
    Unknown robot (identified by 'robot') 972 27.49 MB 21 Apr 2010 - 01:08
    Unknown robot (identified by 'spider') 88 2.11 MB 20 Apr 2010 - 08:39
    Yahoo Slurp 81 1.36 MB 20 Apr 2010 - 16:06
    MSNBot-media 81 389.56 KB 19 Apr 2010 - 13:04
    Alexa (IA Archiver) 79 8.05 MB 20 Apr 2010 - 07:30
    Unknown robot (identified by 'bot*') 65 1.58 MB 20 Apr 2010 - 23:15
    MSNBot 49 780.87 KB 20 Apr 2010 - 23:35
    Unknown robot (identified by empty user agent string) 47 1.43 MB 20 Apr 2010 - 05:07
    Ask 15 1.96 MB 20 Apr 2010 - 12:12
    Unknown robot (identified by 'crawl') 5 127.64 KB 20 Apr 2010 - 02:54
    Unknown robot (identified by '*bot') 2 50.94 KB 19 Apr 2010 - 09:53
    Netcraft 2 24.76 KB 09 Apr 2010 - 03:08
     
    weathor, Apr 20, 2010 IP
  12. chauhanmanish

    chauhanmanish Peon

    Messages:
    36
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #12
    You can also use this in robots.txt, which should help:

    User-agent: * #put spider name in here, or leave it as wildcard
    Crawl-delay: 10
     
    chauhanmanish, Apr 21, 2010 IP
  13. vagrant

    vagrant Peon

    Messages:
    2,284
    Likes Received:
    181
    Best Answers:
    0
    Trophy Points:
    0
    #13
    a) you have not got a robots.txt so it's looking at every possible thing you let it.

    b) a look at google shows what it's indexing (many with the same page title :( )
    site:www.theweatherland.com
     
    vagrant, Apr 24, 2010 IP
  14. amazingserviceprovider

    amazingserviceprovider Peon

    Messages:
    395
    Likes Received:
    3
    Best Answers:
    0
    Trophy Points:
    0
    #14
    High Bandwidth Mate. Go to your webmaster tools, decrease the speed of the crawling. So that your bandwidth will be saved.
     
    amazingserviceprovider, Apr 25, 2010 IP
  15. joshvelco

    joshvelco Peon

    Messages:
    819
    Likes Received:
    8
    Best Answers:
    0
    Trophy Points:
    0
    #15
    Edit your robots.txt to increase the lapse between crawls. 2.2GB is rediculous, unlucky!
     
    joshvelco, Apr 25, 2010 IP
  16. Aaron111

    Aaron111 Well-Known Member

    Messages:
    4,301
    Likes Received:
    29
    Best Answers:
    0
    Trophy Points:
    185
    #16
    what would be wrong with to much Google crawling??? more the maryer
     
    Aaron111, Apr 27, 2010 IP
  17. weathor

    weathor Peon

    Messages:
    148
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #17
    April 2010...

    [​IMG]
     
    weathor, May 2, 2010 IP
  18. sophieharris91

    sophieharris91 Peon

    Messages:
    58
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    0
    #18
    Hey, this 7.62 GB is tooooo much. There must be some issue on your server or you have selected high crawl rate by setting it up within Webmasters. Check it out, otherwise you'd be overlapping your monthly bandwidth at hosting.
     
    sophieharris91, May 3, 2010 IP
  19. Adraco

    Adraco Active Member

    Messages:
    479
    Likes Received:
    14
    Best Answers:
    0
    Trophy Points:
    60
    #19
    Have you created any kind of coding loop which will get the Google robot stuck?

    Else I would suspect someone is using your bandwidth/some images on your site by identifying themselves as the Google Robot. Go in to webmaster.google.com and check your site there, see the crawl rate and the average number of pages crawled per day. If that number doesn't nearly add up towards the mentioned bandwidth, ask your host and see if they can help you out by identifying different IP numbers accessing the site.
     
    Adraco, May 3, 2010 IP
  20. Aaron111

    Aaron111 Well-Known Member

    Messages:
    4,301
    Likes Received:
    29
    Best Answers:
    0
    Trophy Points:
    185
    #20
    I agree -- why worry though he is stuck in G search... ide be happy no worries... :) no stress ...
     
    Aaron111, May 4, 2010 IP