What exactly does Google index?

Discussion in 'Google' started by Global777, May 27, 2008.

  1. #1
    When a GoogleBot visits a site, what code is actually being viewed and indexed? Is it the raw html, for example, or the html that is presented through a browser?

    More to the point, are the bots indexing the css code, if the stylesheet is external, or what I would see if I did a View Source?

    Thanks in advance...
     
    Global777, May 27, 2008 IP
  2. ckgni

    ckgni Active Member

    Messages:
    208
    Likes Received:
    11
    Best Answers:
    0
    Trophy Points:
    58
    #2
    raw HTML output without CSS.

    It is still unknown if it renders Javascript, some people say it does, some others say it doesn't.

    It can also read PDF, Powerpoint and Word documents. Lately there have been several discussions that it may start indexing Flash content too but it is not sure yet.
     
    ckgni, May 27, 2008 IP
  3. IEmailer.com

    IEmailer.com Well-Known Member

    Messages:
    1,864
    Likes Received:
    27
    Best Answers:
    0
    Trophy Points:
    110
    #3
    One correction... Google also started to index SWF flash files and showing the text content on their index as yahoo have been doing for a while :)
     
    IEmailer.com, May 27, 2008 IP
  4. MindBlogger

    MindBlogger Peon

    Messages:
    109
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #4
    Google doesn't index CSS. It just indexes the HTML output, PDF Docs, Dynamic Text in Flash files - that sort of stuff.

    If you're interested in finding how Google sees a specific web site, you can read this article here which features a tool to see what exactly Google sees in a page. (Disclaimer: Yeah, it's from my blog BTW :D)

    Cheers!
     
    MindBlogger, May 27, 2008 IP
  5. onlineshopping

    onlineshopping Banned

    Messages:
    160
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    0
    #5
    Exactly right, i agree with you.

    Well mate Global777 welcome to the dp.
     
    onlineshopping, May 27, 2008 IP
  6. Chrissy17

    Chrissy17 Peon

    Messages:
    561
    Likes Received:
    3
    Best Answers:
    0
    Trophy Points:
    0
    #6
    Interesting...the content of PDF docs? What if those docs are protected?
     
    Chrissy17, May 27, 2008 IP
  7. WishBone

    WishBone Peon

    Messages:
    2,566
    Likes Received:
    15
    Best Answers:
    0
    Trophy Points:
    0
    #7
    That will be another case, I think they will not be indexed (mostly) because of confidentiality.
     
    WishBone, May 27, 2008 IP
  8. Global777

    Global777 Active Member

    Messages:
    2
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    66
    #8
    I want to thank everyone for the input. The answer(s) to my question were more than I expected and all very valuable. Thanks again!
     
    Global777, May 27, 2008 IP
  9. Hersheys

    Hersheys Peon

    Messages:
    2,591
    Likes Received:
    23
    Best Answers:
    0
    Trophy Points:
    0
    #9
    Googlebot doesn't read the code it only read text document within the site.
     
    Hersheys, May 27, 2008 IP
  10. Bloomtools

    Bloomtools Peon

    Messages:
    1,361
    Likes Received:
    12
    Best Answers:
    0
    Trophy Points:
    0
    #10
    You can look in google webmaster tools, through you can find it our what google bots see on your website.
     
    Bloomtools, May 27, 2008 IP
  11. johnenderson

    johnenderson Peon

    Messages:
    542
    Likes Received:
    17
    Best Answers:
    0
    Trophy Points:
    0
    #11
    Because you've selected right place to learn SEO Techniques.

    DP Members will always Eager to Help you but just take care of correct categories and your value of Questions in DP.

    Welcome to DP and enjoy to read some old threads so you will get better ideas about more techniques.

    Thanks
    John
     
    johnenderson, May 27, 2008 IP