Googlebot resting?

anthonycea Banned

Messages:: 13,378

Likes Received:: 342

Best Answers:: 0

Trophy Points:: 0

#21

Well, in that case I may let both you and NC off my meat hook Tops

anthonycea, Oct 27, 2004 IP

leo Peon

Messages:: 174

Likes Received:: 2

Best Answers:: 0

Trophy Points:: 0

#22

Since yesterday, doing the site:www.domain.xx - command for my site gives back only htm/html-pages, not a single pdf. What does that mean? Are pdfs no longer considered valid by G! or what?

leo, Oct 27, 2004 IP

minstrel Illustrious Member

Messages:: 15,082

Likes Received:: 1,243

Best Answers:: 0

Trophy Points:: 480

#23

No, PDFs are still indexed by Google and listed in search results.

Check your Google settings?

minstrel, Oct 27, 2004 IP

Jan Peon

Messages:: 129

Likes Received:: 0

Best Answers:: 0

Trophy Points:: 0

#24

Minstrel,

Sure?

Just checked what could be called the mother of all pdf's - the Adobe Reader help file - linked from several PR 10 pages, but not in the index.

See: http://www.adobe.com/products/acrobat/pdfs/acrruserguide.pdf

Or do a search on adobe reader pdf and see how many pdf's you get...

Jan, Oct 27, 2004 IP

T0PS3O Feel Good PLC

Messages:: 13,219

Likes Received:: 777

Best Answers:: 0

Trophy Points:: 0

#25

Well Jan, you have a point here..

http://www.google.com/search?as_q=i...s_occt=any&as_dt=i&as_sitesearch=&safe=images

Search 'instructions' and limit file type to PDF.

Results 1 - 3 of about 4,640,000 for instructions filetypedf.

Not showing anything beyond these three :S

T0PS3O, Oct 27, 2004 IP

Jan Peon

Messages:: 129

Likes Received:: 0

Best Answers:: 0

Trophy Points:: 0

#26

Thanks TOPS,

And if you click the first hit, the toolbar says this page is not ranked in G.

Jan, Oct 27, 2004 IP

minstrel Illustrious Member

Messages:: 15,082

Likes Received:: 1,243

Best Answers:: 0

Trophy Points:: 480

#27

http://www.google.com/search?q=schi...=&newwindow=1&c2coff=1&safe=off&start=50&sa=N

http://www.google.com/search?as_q=a...=&as_occt=any&as_dt=i&as_sitesearch=&safe=off

http://www.google.com/search?as_q=a...=&as_occt=any&as_dt=i&as_sitesearch=&safe=off

http://www.google.com/search?hl=en&...2coff=1&as_qdr=all&q=psychopathy+filetype:pdf

minstrel, Oct 27, 2004 IP

Mel Peon

Messages:: 369

Likes Received:: 14

Best Answers:: 0

Trophy Points:: 0

#28

T0PS3O said:

Based on Stanford Uni and Google and other crawling whitepapers. So no, not 100% sure this is fact but just indexing whatever you come across is downright a stupid system.
Click to expand...

Well Tops you seem to be saying that Googlebot first visits the page just to get the links. Now the only way that I know for the bot to get the links from the page is to get the whole page, return it to the repository, and then parse it for links.

So what is the advantage in going back again, when you already have the page in your index?

BTW this is how Google say they do it:

So it may be that the URL server may decide to put this page or that on a higher or lower priority, but there is really no need to visit the page twice just to get it indexed.

Mel, Oct 28, 2004 IP

T0PS3O Feel Good PLC

Messages:: 13,219

Likes Received:: 777

Best Answers:: 0

Trophy Points:: 0

#29

I never said it indexes it twice.

It indexes it once, takes out the URL's and add it to the URL queue.

That gets sorted and crawled one by one. So it doesn't go back to where it found it, it start from that URL put in the list.

So it continues crawling from the URL queue, not from the page it found the URL. That's all I am saying.

T0PS3O, Oct 28, 2004 IP

Log in or Sign up

Advertising (learn more)

Googlebot resting?

anthonycea Banned

leo Peon

minstrel Illustrious Member

Jan Peon

T0PS3O Feel Good PLC

Jan Peon

minstrel Illustrious Member

Mel Peon

T0PS3O Feel Good PLC

Log in or Sign up

Advertising (learn more)

Googlebot resting?

anthonycea Banned

leo Peon

minstrel Illustrious Member

Jan Peon

T0PS3O Feel Good PLC

Jan Peon

minstrel Illustrious Member

Mel Peon

T0PS3O Feel Good PLC

Useful Searches