How to get robots.txt re-crawled by google/yahoo, etc?

Discussion in 'robots.txt' started by artboy, Mar 10, 2008.

  1. #1
    How often do search engines hit up this file? I know .htaccess is hit for every single website request, but with robots.txt, it seems like google/etc only do it every once in a while.

    Any idea of how often they do it, and/or how to force them to realize it's been updated?
     
    artboy, Mar 10, 2008 IP
  2. nessie

    nessie Active Member

    Messages:
    284
    Likes Received:
    9
    Best Answers:
    0
    Trophy Points:
    80
    #2
    You can get an idea of how often they [Google] do that by Google webmaster tools.
     
    nessie, Mar 14, 2008 IP
  3. shivam

    shivam Peon

    Messages:
    679
    Likes Received:
    10
    Best Answers:
    0
    Trophy Points:
    0
    #3
    Hi

    1) you need to create sitemap.xml and sitemap.txt
    2) add/edit yoursite.com/robots.txt url in those sitemaps (both)
    3) have metatag commands like <meta name=robots content="index, follow, all">
    4) join google webmaster tools and submit your robots.txt, sitemap.xml, verify your site by add google give codes in your site. (if you don't verify it is not worth it so you must have to verify, now if you don't know we can help you so please pm)
    5) submit your sitemap URL to yahoo as you submiting your site URL normaly to yahoo.

    please PM me know if you not sure of anything
     
    shivam, Mar 14, 2008 IP
  4. SearchMarketing

    SearchMarketing Peon

    Messages:
    158
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #4
    By the way does anyone have a sitemap generator I can use? I really need one.

    Is there a way to create sitemap on blogspot?
     
    SearchMarketing, Mar 16, 2008 IP
  5. nessie

    nessie Active Member

    Messages:
    284
    Likes Received:
    9
    Best Answers:
    0
    Trophy Points:
    80
    #5
    nessie, Mar 16, 2008 IP
  6. SearchMarketing

    SearchMarketing Peon

    Messages:
    158
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #6
    thank you. sorry for being a bit off topic from the thread. Please continue :)
     
    SearchMarketing, Mar 16, 2008 IP
  7. lhughes33309

    lhughes33309 Peon

    Messages:
    3
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #7
    Hi,

    Here is the quick & dirty way to get google to load your latest robots.txt file:
    Login to webmaster tools, select Url Tools at the bottom of the list, next select the url removal tool
    and select a page on your site for removal.

    It will now say pending, now watch your httpd logs, in less than 1 hour(most times about 10 - 15 mins.) googlebot will grab your robots.txt file.

    Now go back to Webmaster tools url removal tool and select cancel for the page you selected earlier.

    Now check the robots.txt file in Webmaster Tools and you will see it is only a few minutes old.

    That is how you do it (After google reads this, it might change)......

    lhughes33309
     
    lhughes33309, Mar 21, 2008 IP
  8. manish.chauhan

    manish.chauhan Well-Known Member

    Messages:
    1,682
    Likes Received:
    35
    Best Answers:
    0
    Trophy Points:
    110
    #8
    Just use yoursite.blogspot.com/atom.xml as a sitemap...
    and believe me this will work..:)
     
    manish.chauhan, Apr 4, 2008 IP