Hello Everyone, I've done a bit of research on this question and I've come up empty-handed... Can you tell me whether the content of a PDF document is read by Google? If so, are links within the PDF indexed and do they pass any value? Thanks for your help.
I don't they they are because the pdf content is not indexed usually, unless a HTML option is presented.
My thoughts exactly, if it was indexed, I'm sure it would be, but I can't recall ever seeing an indexed PDF file.
Good point... I've had limited experience with scribd, so maybe that's true. What little I have, though, I can't remember seeing a page with a PDF being indexed. Now that I think about it... those files aren't text, and they're in a frame, so maybe the links wouldn't get indexed. Let me know if I'm wrong, I'm now very curious about this.
I say yes for various reasons. In attempts to qualify that statement I dug up the following which is actually a fairly interesting read. http://wiep.net/link-value-factors/LinkValueFactors.pdf note its in pdf but I am guessing its in the archive or a cached page on the site. This does not really take into consideration that there is a mime type html equivalent of the page(I would assume) hope that helps, Nigel
Yes, a PDF will be indexed and the links will appear in search results. I noticed one of mine a few days ago.
Thanks for your input. I suppose the answer is "we don't know." I tried an experiment which seems to indicate that links within a PDF are not indexed & therefore don't pass value. Here's what I did: #1 I picked a subject (Search Engine Optimization) and found 3 random PDFs using Google #2 Opened each document and found a hyperlinked URL (Note: not just text, but an active link) #3 Identified each link's destination URL #4 Ran a "linkdomain:" query on the destination URL using Yahoo Site Explorer #5 Found no reference to the page on which the PDF resides. Does anyone have any comments? Does this make sense? Thanks.