Bollywood Wallpapers - Mortgages - Busby SEO Test - Problem Mortgage - Hummer Dealer

PDA

View Full Version : Bias in Alexa Rankings


minstrel
Mar 5th 2007, 9:39 am
Alexa Toolbar and the Problem of Experiment Design (http://norvig.com/logs-alexa.html)
by Peter Norvig

Consider the problem of comparing traffic to internet sites. Most sites keep their traffic numbers secret, so you need to rely on third parties that monitor a sampling of traffic. One such third party is the alexa.com traffic rankings. If you download a toolbar from Alexa, your visits are tracked anonymously, and the aggregate statistics are available for all to see. As Alexa explains there are some biases inherent in this process: sites associated with Alexa, such as Amazon.com are overrepresented; sites that use https protocol are underrepresented, and so on. But one bias they don't really comment is the selection bias: the data would be good if it truly represented a random sample of internet users, but in fact it only represents those who have installed the Alexa toolbar, and that sample is not random. The samplees must be sophisticated enough to know how to install the toolbar, and they must have some reason to want it. It turns out that the toolbar tells you things about web sites, so it is useful to people in the SEO industry, so it overrperesents those people.

For example, let's look at the log stats for my site and for some of my friends who have recently published their stats for 2006. We list the actual number of visits and pageviews, and the Alexa numbers for reach and pageviews. The difference is quite profound. For example, I get about twice the pageviews of mattcutts.com, but his Alexa pageview ranking is about 25 times more than mine (I got this by looking at the 1 year, most highly smoothed graph, and then squinting to guess at the mean). What that means is that people with the Alexa toolbar installed are 25 times more likely to view a page on Matt's site versus mine, but overall, all users view twice as many pages on my site. That's a 50 to 1 difference introduced by the selection bias of Alexa. Presumably this is because Matt's site is really appealing to a core group of SEO enthusiasts, many of whom also like the Alexa toolbar.

Keep that in mind, next time you see a statistic on web usage (or any statistic): the results are only as good as the selection process that brings in the data.

...read more and view data (http://norvig.com/logs-alexa.html)

max pain
Mar 5th 2007, 10:12 am
Good to see the findings supported by statistics - though it will still be difficult to convince the ALEXA fans (especially those who make it a point of sale).

SFOD_D223
Mar 5th 2007, 1:13 pm
Nice write-up on some key issues with alot of emphasis on Alexa rankings. Thanks for the insight.

chuckd1356
Mar 5th 2007, 1:41 pm
Interesting article! Thanks!

Bezzen
Mar 9th 2007, 6:06 am
My motorsport sites gets about ten times more visitors than a directory I'm running. The directory is ranked at 25,000 something and my motorsport site is usually around 600,000. Now, why can't more webmaster be interested in motorsports? ;)

ronaldmarva
Mar 14th 2007, 9:21 pm
Alexa is good only for reference and given its biased ranking i dont think its data should only be given a token consideration.

kodut
Mar 14th 2007, 9:29 pm
alexa will maintain its dominance in providing web site trafiic details and people will also use it as a prime source for reference inspite of any such news

;d

minstrel
Mar 15th 2007, 7:09 am
alexa will maintain its dominance in providing web site trafiic details and people will also use it as a prime source for reference inspite of any such news

What people? I think even fewer people have heard of Alexa rankings than have heard about PageRank.

Tehy
Mar 15th 2007, 7:20 am
Thanks! Good information! :)

maldives
Mar 15th 2007, 7:48 am
My travel site gets triple more traffic than my directory but, Alexa ranking for the directory is way better than the travel site.

I can not beleive their stats. :mad: