1. Advertising
    y u no do it?

    Advertising (learn more)

    Advertise virtually anything here, with CPM banner ads, CPM email ads and CPC contextual links. You can target relevant areas of the site and show ads based on geographical location of the user if you wish.

    Starts at just $1 per CPM or $0.10 per CPC.

what is robot.txt and how is it important for SEO

Discussion in 'robots.txt' started by milan.mysterio, Jun 8, 2010.

  1. #1
    what is robot.txt and how is it important for SEO??
    what if this file is not there in website??
     
    milan.mysterio, Jun 8, 2010 IP
  2. harry009

    harry009 Active Member

    Messages:
    93
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    73
    #2
    robots.txt is nothing bt txt file it contain code to instruct robots when they visit out site. we can give instructions to robots to not crawl particular page
     
    harry009, Jun 10, 2010 IP
  3. manish.chauhan

    manish.chauhan Well-Known Member

    Messages:
    1,682
    Likes Received:
    35
    Best Answers:
    0
    Trophy Points:
    110
    #3
    Robots.txt file is most important section of a website and it instructs crawlers what they should visit in the website and what they should ignore.
     
    manish.chauhan, Jun 10, 2010 IP
  4. stephan2307

    stephan2307 Well-Known Member

    Messages:
    1,277
    Likes Received:
    33
    Best Answers:
    7
    Trophy Points:
    150
    #4
    Most people think that you can use robots to tell spiders what to access and what not.

    However there are many more things you can do.

    1. You can provide sitemaps to spiders
    2. Give them request rates (ie 1/20 means 1 page every 20 seconds)
    3. Tell them between what hours you only want them to visit
    4. How long they should wait before requesting the next page

    If you have a popular website using these details in the robots.txt file can help you to save resources and get the site running faster and schedule the spiders to crawl your website at a time when your site has usually low number of visitors (night time).

    But then in the end there are more spiders that don't obey robots.txt instructions than there are that obey them.
     
    stephan2307, Jun 16, 2010 IP
  5. stephan2307

    stephan2307 Well-Known Member

    Messages:
    1,277
    Likes Received:
    33
    Best Answers:
    7
    Trophy Points:
    150
    #5
    Also forgot that depending on your server settings having a robots.txt will actually save you bandwidth.

    Most crawlers will request the robots.txt (whether they be good or evil ones). Now if you don't have the robots.txt file they will get the 404 page delivered. They will know that there is no robots.txt and start crawling your page. However 404 pages are most of the time bigger than a robots.txt. So every time a search engine requests your robots.txt you are loosing bandwidth. Even if you simply create an empty file of allow all spiders to crawl everything you will save bandwidth
    
    User-agent: *
    Disallow:
    
    Code (markup):
    Code above will allow all crawlers to crawl everything.
     
    stephan2307, Jun 16, 2010 IP
  6. stephan2307

    stephan2307 Well-Known Member

    Messages:
    1,277
    Likes Received:
    33
    Best Answers:
    7
    Trophy Points:
    150
    #6
    Just did a little test to the above.

    my 404 page: 3.27kb
    my robots.txt: 0.02kb

    So you can see the savings in terms of bandwidth.
     
    stephan2307, Jun 16, 2010 IP
  7. manish.chauhan

    manish.chauhan Well-Known Member

    Messages:
    1,682
    Likes Received:
    35
    Best Answers:
    0
    Trophy Points:
    110
    #7
    Can you please explain the motive of your test and its outcome?
     
    manish.chauhan, Jun 23, 2010 IP
  8. fharez89

    fharez89 Peon

    Messages:
    29
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #8
    keyword your seo.you own the input this seo
     
    fharez89, Jun 29, 2010 IP
  9. edpatton

    edpatton Active Member

    Messages:
    261
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    83
    Digital Goods:
    1
    #9
    yea the robot.txt is a way to letting google crawl your site for seo purposes.
     
    edpatton, Jun 29, 2010 IP