Digital Point Forums
iKobo

Go Back   Digital Point Forums > Search Engines > Google
Google Analytics
Log In to view
your analytics

Reply
 
Thread Tools
  #1  
Old Sep 15th 2004, 10:51 pm
digitalpoint's Avatar
digitalpoint digitalpoint is offline
My cat is on Prozac... really. lol
 
Join Date: Mar 2004
Location: San Diego, California
Posts: 22,359
digitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond repute
Phone Verified
New Googlebot?

Has anyone noticed a new Googlebot lurking around?

I'm getting hit by two different kinds. The normal one:

66.249.64.47 - - [15/Sep/2004:18:59:12 -0700] "GET /robots.txt HTTP/1.0" 404 1227 "-" "Googlebot/2.1 (+http://www.google.com/bot.html)"

and also this one:

66.249.66.129 - - [15/Sep/2004:18:12:51 -0700] "GET / HTTP/1.1" 200 38358 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"

Aside from the slightly different user agent, it's also HTTP 1.1. The IP address it uses is an IP block is normally just used for Mediapartners (AdSense spider), but it's spidering a site without any AdSense.

Also, the spidering pattern is different. Instead of using multiple IPs and getting groups at a time, this one seems to be a slower, steady spidering, multiple levels deep in a single pass.
__________________
- Shawn
Keyword Tracker now supports Google (once again) as well as Bing (new) and Yahoo
Please do not PM, IM or email me for product or tool support (they will go unread/ignored), and don't "friend" me unless we are really friends.
Reply With Quote
  #2  
Old Sep 16th 2004, 12:14 am
Old Welsh Guy's Avatar
Old Welsh Guy Old Welsh Guy is offline
Starcaller
 
Join Date: Mar 2004
Location: Wales UK
Posts: 2,696
Old Welsh Guy has much to be proud ofOld Welsh Guy has much to be proud ofOld Welsh Guy has much to be proud ofOld Welsh Guy has much to be proud ofOld Welsh Guy has much to be proud ofOld Welsh Guy has much to be proud ofOld Welsh Guy has much to be proud ofOld Welsh Guy has much to be proud ofOld Welsh Guy has much to be proud ofOld Welsh Guy has much to be proud of
This is the spider that G has developed that will read javascript and pull url's, and also can kind of read flash content. also logging as googlebot/new.

So all you javascript spammer beware
Reply With Quote
  #3  
Old Sep 16th 2004, 12:25 am
fluke's Avatar
fluke fluke is offline
Champion of the Naaru
 
Join Date: Jun 2004
Location: Chesterfield
Posts: 209
fluke is on a distinguished road
How on earth does it read flash? (or "kind of" read flash?)

Just looked at my log files - and i see it - didn't look to far back but it came this morning about 40 minutes after normal Gbot
__________________
Sanibel View Steam Railway Photos
Reply With Quote
  #4  
Old Sep 16th 2004, 12:37 am
Arnica's Avatar
Arnica Arnica is offline
Hand of A'dal
 
Join Date: Apr 2004
Location: UK
Posts: 320
Arnica will become famous soon enough
I had the new crawl around a week or so but not since.

Mick
__________________
Bitingedge
Website Design UK | Hot Air Balloon Flights
Reply With Quote
  #5  
Old Sep 16th 2004, 1:11 am
SEbasic's Avatar
SEbasic SEbasic is offline
Astral Walker
 
Join Date: May 2004
Location: Souf Eyst Lundun
Posts: 6,309
SEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to behold
Thanks for the heads up Shawn...
__________________
[Ol]
Weirfire does freelance work
Reply With Quote
  #6  
Old Sep 16th 2004, 2:49 am
xml xml is offline
Hand of A'dal
 
Join Date: May 2004
Location: Manchester
Posts: 254
xml is on a distinguished road
I was gonna post a similar thread.

Initially I thought "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)" was just someone who switched their user-agent.

That was until it grabbed 6000 pages. I got suspicious and check IP and odly enough it's on Googles IP range.
__________________
Song Lyrics, Latest News
Reply With Quote
  #7  
Old Sep 16th 2004, 2:57 am
Redleg Redleg is offline
Raider
 
Join Date: Jul 2004
Location: Norway
Posts: 360
Redleg is on a distinguished road
I had several visits by this new googlebot a couple of days ago.
Don't remember the exact IP addresses (about 15-20 of them) but here's the IP ranges. (I did write them down on a piece of paper):
66.249.78.* 66.249.64.* 66.249.79.*
__________________
| VoIP USA | Analyze and Geolocate your IP Address | |
Reply With Quote
  #8  
Old Sep 16th 2004, 4:34 am
a389951l's Avatar
a389951l a389951l is offline
Must Create More Content
 
Join Date: Mar 2004
Location: New England
Posts: 1,884
a389951l has a spectacular aura abouta389951l has a spectacular aura abouta389951l has a spectacular aura about
Yeah just checked my log files and noticed it too.

Old Welsh Guy how do we know that it can read javascript?
__________________
Free SEO Tips || Business Directory
Reply With Quote
  #9  
Old Sep 16th 2004, 4:34 am
nadlay nadlay is offline
Hand of A'dal
 
Join Date: Jun 2004
Posts: 306
nadlay is on a distinguished road
One of my sites normally gets hit by Googlebot at the same time each day, but for the last 3 days, I've been getting two hits, with the second coming about 15 minutes after the first.

I thought it strange, but hadn't had time to investigate, but now I look in my stats, and I'm also getting both GoogleBots, as Shawn described.
Reply With Quote
  #10  
Old Sep 16th 2004, 5:52 am
flawebworks's Avatar
flawebworks flawebworks is offline
Tech Services
 
Join Date: Apr 2004
Location: here
Posts: 991
flawebworks will become famous soon enough
I've been getting this one all night: 66.249.65.212
Reply With Quote
  #11  
Old Sep 16th 2004, 7:07 am
digitalpoint's Avatar
digitalpoint digitalpoint is offline
My cat is on Prozac... really. lol
 
Join Date: Mar 2004
Location: San Diego, California
Posts: 22,359
digitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond repute
Phone Verified
This one hasn't grabbed any JavaScript as the Googlebot/Test bot did, but it is HTTP 1.1 like Googlebot/Test is/was. Just wish they would grab files compressed when available now (since 1.1 supports it).
__________________
- Shawn
Keyword Tracker now supports Google (once again) as well as Bing (new) and Yahoo
Please do not PM, IM or email me for product or tool support (they will go unread/ignored), and don't "friend" me unless we are really friends.
Reply With Quote
  #12  
Old Sep 16th 2004, 7:29 am
SEbasic's Avatar
SEbasic SEbasic is offline
Astral Walker
 
Join Date: May 2004
Location: Souf Eyst Lundun
Posts: 6,309
SEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to behold
Quote:
Just wish they would grab files compressed when available now (since 1.1 supports it).
Could you clarify that please. Not too sure what you mean.
__________________
[Ol]
Weirfire does freelance work
Reply With Quote
  #13  
Old Sep 16th 2004, 7:41 am
digitalpoint's Avatar
digitalpoint digitalpoint is offline
My cat is on Prozac... really. lol
 
Join Date: Mar 2004
Location: San Diego, California
Posts: 22,359
digitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond repute
Phone Verified
You can setup your servers to compress (basically gzip) your HTML documents before sending it to a browser (if the browser supports HTTP 1.1, it's an option... it's not an option for 1.0). For example, this forum compresses the HTML sent to you. The bandwidth savings on this are pretty big. For example, this forum's main index page (when I just tested it) is 44,007 bytes, but since it's sent out compressed (which the client side decompresses), the bandwidth used is 9,099 bytes.
__________________
- Shawn
Keyword Tracker now supports Google (once again) as well as Bing (new) and Yahoo
Please do not PM, IM or email me for product or tool support (they will go unread/ignored), and don't "friend" me unless we are really friends.
Reply With Quote
  #14  
Old Sep 16th 2004, 7:44 am
SEbasic's Avatar
SEbasic SEbasic is offline
Astral Walker
 
Join Date: May 2004
Location: Souf Eyst Lundun
Posts: 6,309
SEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to behold
WOW, that's a pretty big difference.

And the new GoogleBot doesn't take advantage of that then?
__________________
[Ol]
Weirfire does freelance work
Reply With Quote
  #15  
Old Sep 16th 2004, 8:08 am
digitalpoint's Avatar
digitalpoint digitalpoint is offline
My cat is on Prozac... really. lol
 
Join Date: Mar 2004
Location: San Diego, California
Posts: 22,359
digitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond repute
Phone Verified
I didn't think so, but I just remembered that the server it's spidering of mine right now didn't have it turned on. So I just turned it on, and waited for it, and low and behold, it *is* using compression now!

That is bad ASS, and something I was wishing for.
__________________
- Shawn
Keyword Tracker now supports Google (once again) as well as Bing (new) and Yahoo
Please do not PM, IM or email me for product or tool support (they will go unread/ignored), and don't "friend" me unless we are really friends.
Reply With Quote
  #16  
Old Sep 16th 2004, 8:16 am
SEbasic's Avatar
SEbasic SEbasic is offline
Astral Walker
 
Join Date: May 2004
Location: Souf Eyst Lundun
Posts: 6,309
SEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to behold
I have a few questions about this if you don't mind - I really don't know anything about it.

1- So, are there duplicates of each file sitting on your server then, or does the server recognise the HTTP1.1 and then serve the file accordingly with the compression?

2- Does it put a lot more sress on servers if you are running it?

3- Does it increase loading times on the users browser - does it put more stress on the users CPu (I guess the difference would be neglegable if it does)?

I did think of more questions but I'm sure I could find the anwsers out if I looked hard enough.
__________________
[Ol]
Weirfire does freelance work
Reply With Quote
  #17  
Old Sep 16th 2004, 8:23 am
digitalpoint's Avatar
digitalpoint digitalpoint is offline
My cat is on Prozac... really. lol
 
Join Date: Mar 2004
Location: San Diego, California
Posts: 22,359
digitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond repute
Phone Verified
It does not replicate data... it compresses it on the fly. It really depends on if your server is more bandwidth limited or CPU limited if it's worth turning on or not. I run it at the lowest compression level so it doesn't stress the CPU (my servers get a lot of traffic). Loading time should actually be a little faster for the user because they have less data to download. Really just depends on how fast their computer can decompress the file, compared to downloading a larger one.

A simple way to turn it on for PHP files only would be to add this to your .htaccess file:

Code:
php_value zlib.output_compression 1
php_value zlib.output_compression_level 1
The higher the compression_level number, the better the compression (but more CPU overhead).
__________________
- Shawn
Keyword Tracker now supports Google (once again) as well as Bing (new) and Yahoo
Please do not PM, IM or email me for product or tool support (they will go unread/ignored), and don't "friend" me unless we are really friends.
Reply With Quote
  #18  
Old Sep 16th 2004, 8:27 am
SEbasic's Avatar
SEbasic SEbasic is offline
Astral Walker
 
Join Date: May 2004
Location: Souf Eyst Lundun
Posts: 6,309
SEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to behold
Thanks for that shawn.

So If I wanted to find a little more about it, what would be the correct termonology to use on a search.

How would that .htaccess file be used in reference to a .cfm extension?
__________________
[Ol]
Weirfire does freelance work
Reply With Quote
  #19  
Old Sep 16th 2004, 8:29 am
digitalpoint's Avatar
digitalpoint digitalpoint is offline
My cat is on Prozac... really. lol
 
Join Date: Mar 2004
Location: San Diego, California
Posts: 22,359
digitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond reputedigitalpoint has a reputation beyond repute
Phone Verified
The .htaccess thing is just for PHP files. Look for mod_gzip for Apache for server-wide compression.

You can find the mod_gzip project at:

http://sourceforge.net/projects/mod-gzip/
__________________
- Shawn
Keyword Tracker now supports Google (once again) as well as Bing (new) and Yahoo
Please do not PM, IM or email me for product or tool support (they will go unread/ignored), and don't "friend" me unless we are really friends.
Reply With Quote
  #20  
Old Sep 16th 2004, 8:30 am
SEbasic's Avatar
SEbasic SEbasic is offline
Astral Walker
 
Join Date: May 2004
Location: Souf Eyst Lundun
Posts: 6,309
SEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to beholdSEbasic is a splendid one to behold
Thanks for that. I'll look in to it. Could same some cash...
__________________
[Ol]
Weirfire does freelance work
Reply With Quote
Reply

Bookmarks

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Question about Googlebot Chiara Google 14 May 10th 2007 6:23 pm
Googlebot and PHPbb forums? Redleg HTML & Website Design 12 Sep 1st 2004 5:00 pm
"unusual" google and googlebot activity?? Redleg Google 2 Aug 27th 2004 9:36 am
GoogleBot Stopping at "Home" GuyFromChicago Google 8 Mar 25th 2004 2:28 am
Googlebot Spidering... digitalpoint Google 7 Mar 8th 2004 4:17 pm


All times are GMT -8. The time now is 3:59 am.