Digital Point Forums
Winn Law Group

Go Back   Digital Point Forums > Design & Development > Site & Server Administration > Traffic Analysis
Google Analytics
Log In to view
your analytics

Reply
 
Thread Tools
  #1  
Old Dec 18th 2005, 1:02 pm
ymgem ymgem is offline
Peon
 
Join Date: Sep 2005
Posts: 2
ymgem is on a distinguished road
Recognizing Search Engine Bots

Hi,

I am very, very new at site creation and have only just uploaded my very first brand new site, so please excuse me if my questions seem a bit naive.

How can I know, from my webstats, which bot has read my site?

The obvious ones, like google or msn are written, but what are:

1. BLA
2. ia_archiver-web.archive.org
3. MetaTagRobot

As I said, my site is very, very new, so what other ones should I expect over the coming weeks?

And the million dollar question, should anybody know the answer, is how long after they appear in my stats should I expect to receive visitors from search engines?
Reply With Quote
  #2  
Old Dec 18th 2005, 1:53 pm
sarahk's Avatar
sarahk sarahk is offline
iTamer
 
Join Date: Mar 2004
Location: itamer @ Fibs
Posts: 9,903
sarahk has a reputation beyond reputesarahk has a reputation beyond reputesarahk has a reputation beyond reputesarahk has a reputation beyond reputesarahk has a reputation beyond reputesarahk has a reputation beyond reputesarahk has a reputation beyond reputesarahk has a reputation beyond reputesarahk has a reputation beyond reputesarahk has a reputation beyond reputesarahk has a reputation beyond repute
Phone Verified
Quote:
Originally Posted by ymgem
1. BLA
No Idea
Quote:
Originally Posted by ymgem
2. ia_archiver-web.archive.org
from the way back machine at www.archive.org
Quote:
Originally Posted by ymgem
3. MetaTagrobot
no idea but something parsing based on your meta tags I guess

Rule #1: don't worry about the important bots visiting - submit once, get backlinks, don't stress

Rule #2: don't expect to be able to identify every bot that visits. There are literally thousands and it's just not worth the stress. Between the referral spammers, the spoofers (pretend to be googlebot when they're not) and the people verifying their backlinks, the subscription only search engines you'll be exhausted just trying to keep up.
Reply With Quote
  #3  
Old Dec 19th 2005, 2:42 pm
ahearn's Avatar
ahearn ahearn is offline
Hand of A'dal
 
Join Date: Dec 2005
Posts: 292
ahearn will become famous soon enough
MetaTagRobot is from this site. I don't know if the crawls are automatic or if they are manually initiated, and know little else about it.

Here are some bots that visit one of my sites:
Googlebot
MSNBot
Inktomi Slurp
WISENutbot
LinkWalker
Unknown robot (identified by hit on 'robots.txt')
Unknown robot (identified by 'crawl')
AskJeeves
Walhello appie
Alexa (IA Archiver)
Lycos
Reply With Quote
  #4  
Old Dec 21st 2005, 8:27 am
joaquin joaquin is offline
Hand of A'dal
 
Join Date: Oct 2005
Location: Qro, Mexico
Posts: 311
joaquin is on a distinguished road
yes use the meta bot thing.. I think it's pretty easy to spot minor both though
Reply With Quote
  #5  
Old Dec 21st 2005, 8:45 am
minstrel's Avatar
minstrel minstrel is offline
Celestial Defender
 
Join Date: Sep 2004
Location: Ottawa, Canada
Posts: 15,050
minstrel has a reputation beyond reputeminstrel has a reputation beyond reputeminstrel has a reputation beyond reputeminstrel has a reputation beyond reputeminstrel has a reputation beyond reputeminstrel has a reputation beyond reputeminstrel has a reputation beyond reputeminstrel has a reputation beyond reputeminstrel has a reputation beyond reputeminstrel has a reputation beyond reputeminstrel has a reputation beyond repute
Quote:
Originally Posted by sarahk
Rule #2: don't expect to be able to identify every bot that visits. There are literally thousands and it's just not worth the stress. Between the referral spammers, the spoofers (pretend to be googlebot when they're not) and the people verifying their backlinks, the subscription only search engines you'll be exhausted just trying to keep up.
Definitely. You'll drive yourself nuts worrying about them all and they'll just keep shifting each time you block one variation anyway...
Reply With Quote
  #6  
Old Dec 31st 2005, 3:46 am
MattBeard MattBeard is offline
Hand of A'dal
 
Join Date: Dec 2005
Location: North-East Scotland
Posts: 259
MattBeard is on a distinguished road
I get:

cache-xxx-yyyy.proxy.aol.com

and:

nnn-nnn-nnn-nnn.gen.twtelecom.net

Call by a lot and do very little. I guess that the first is a caching proxy at AOL (maybe it also does search crawling too) but the second one stumps me. I think that it just reads one thing from the root, either the root directory or the robots.txt file.

Any ideas?
Reply With Quote
  #7  
Old Dec 31st 2005, 12:08 pm
sarahk's Avatar
sarahk sarahk is offline
iTamer
 
Join Date: Mar 2004
Location: itamer @ Fibs
Posts: 9,903
sarahk has a reputation beyond reputesarahk has a reputation beyond reputesarahk has a reputation beyond reputesarahk has a reputation beyond reputesarahk has a reputation beyond reputesarahk has a reputation beyond reputesarahk has a reputation beyond reputesarahk has a reputation beyond reputesarahk has a reputation beyond reputesarahk has a reputation beyond reputesarahk has a reputation beyond repute
Phone Verified
A quick search on Google led me here: http://www.webmasterworld.com/forum39/3855.htm
Reply With Quote
  #8  
Old Dec 31st 2005, 1:29 pm
MattBeard MattBeard is offline
Hand of A'dal
 
Join Date: Dec 2005
Location: North-East Scotland
Posts: 259
MattBeard is on a distinguished road
OK, now I just need to decide if I care about WebSense

I should have thought to google it, but I always assumed it was some sort of search engine crawler
__________________
Information about MXF
Get the best deals on hotel bookings with HotelClub
Reply With Quote
Reply

Bookmarks

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Search Engine Friendly URLs and the bots that break them clickbuild Search Engine Optimization 5 Oct 4th 2005 4:57 pm
Search Engine Bots Bandwidth Usage nickberry All Other Search Engines 20 May 2nd 2005 2:12 pm
search engine script - html in the search term bill All Other Tools 1 Mar 5th 2005 8:48 am
Search Engine.... Astrax All Other Tools 4 Apr 14th 2004 1:10 pm


All times are GMT -8. The time now is 11:25 am.