Confusion in Crawling and Indexing

Discussion in 'Search Engine Optimization' started by Ridhdhi, Jul 2, 2008.

  1. #1
    Hi All,

    Can any one tell me the difference between crawling and indexing???

    And the code for stopping crawling for particular page??

    Should i place robot.txt or "noindex,nofollow" in meta ???
     
    Ridhdhi, Jul 2, 2008 IP
  2. RULE

    RULE Well-Known Member

    Messages:
    476
    Likes Received:
    7
    Best Answers:
    1
    Trophy Points:
    180
    #2
    Buddy,,,,

    Robots.txt is used for blocking sites or inner pages so that googlbot can't index the pages that u don't want to show up in search index ex: images, private, cgi-bin etc,,,,,


    Robots.txt is the best one example as show in below

    For a site : www.xyz.com/robots.txt



    User-agent: *
    Disallow: /cgi-bin
    Disallow: /cp
    Disallow: /downloads
    Disallow: /images1
    Disallow: /widgets

    By applying this rules u can disallow a folder or directory which u don't want to crawl as mentioned above.

    Thanx

    Google Lover
     
    RULE, Jul 2, 2008 IP
  3. thecatalyst

    thecatalyst Active Member

    Messages:
    156
    Likes Received:
    3
    Best Answers:
    0
    Trophy Points:
    70
    #3
    Crawling is what bots to do to get all the info from your site. indexing is that info displayed in search results. I would use the robots text to restrict bots from searching certain files or directories.
     
    thecatalyst, Jul 2, 2008 IP
  4. webcosmo

    webcosmo Notable Member

    Messages:
    5,840
    Likes Received:
    153
    Best Answers:
    2
    Trophy Points:
    255
    #4
    Crawling is the process of reading your website pages by search engine bots.

    Indexing is the process of listing your site pages ranked for different keywords.
     
    webcosmo, Jul 2, 2008 IP
  5. seoperson

    seoperson Peon

    Messages:
    501
    Likes Received:
    9
    Best Answers:
    0
    Trophy Points:
    0
    #5
    Crawling: The act of a spider following links and gathering the data from the pages it visits.
    Indexing: The process whereby a Search Engine adds web pages to its database.


    See the link below:
    http://www.google.com/support/webmasters/bin/answer.py?hl=en&answer=35769

    Read the last point in Technical guidelines.

    Also, for the robots.txt:
    http://googlewebmastercentral.blogspot.com/2008/06/improving-on-robots-exclusion-protocol.html
    http://www.google.com/support/webmasters/bin/answer.py?answer=40362
     
    seoperson, Jul 2, 2008 IP
  6. Australianfranchises

    Australianfranchises Peon

    Messages:
    1,230
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    0
    #6
    yes i agree with u...
     
    Australianfranchises, Jul 2, 2008 IP
  7. Aryans

    Aryans Well-Known Member

    Messages:
    1,854
    Likes Received:
    31
    Best Answers:
    1
    Trophy Points:
    178
    #7
    Exactly that i want to say Agreed
     
    Aryans, Jul 2, 2008 IP
  8. Sumeshnarmath

    Sumeshnarmath Banned

    Messages:
    98
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #8
    Sumeshnarmath, Jul 3, 2008 IP
  9. Ajay019

    Ajay019 Banned

    Messages:
    540
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #9
    Crawling is the process of reading the data on the page, indexing is putting this data (and other meta data) in the search engines database (index).
     
    Ajay019, Jul 4, 2008 IP
  10. nairbuoyevoli

    nairbuoyevoli Active Member

    Messages:
    579
    Likes Received:
    3
    Best Answers:
    0
    Trophy Points:
    58
    #10
    crawling... find
    index.. pick, stored

    :) thats my own definition.. lol
     
    nairbuoyevoli, Jul 4, 2008 IP
  11. lightlysalted

    lightlysalted Active Member

    Messages:
    2,067
    Likes Received:
    32
    Best Answers:
    0
    Trophy Points:
    90
    #11
    Rule is absolutely on the button on this one, good piece of code as well
     
    lightlysalted, Jul 4, 2008 IP