Wikipedia founder takes on Google

2003m2003 Well-Known Member

Messages:: 863

Likes Received:: 17

Best Answers:: 0

Trophy Points:: 138

#1

Interesting news about wikipedia and Google relationship, read...

http://news.bbc.co.uk/2/hi/technology/6335793.stm

This relationship you can see almost every keyword SERP in Google, wikipedia in frist page!!.

2003m2003, Feb 9, 2007 IP

moneyspeaks Peon

Messages:: 450

Likes Received:: 5

Best Answers:: 0

Trophy Points:: 0

#2

it will be interesting to see how wikipedia attempts to take business and traffic from the search markets.

moneyspeaks, Feb 9, 2007 IP

Pierce Active Member

Messages:: 634

Likes Received:: 26

Best Answers:: 0

Trophy Points:: 95

#3

yeah right,

google isnt just a ranking technology, or a search technology, but a crawling technology too.

unless you have the capability to crawl as much of the internet as fast as google (every domain ((one page per domain)) on the internet every 20 minutes) you will never be able to compete with google(thats per crawl server at LEAST).

Storing that information, google has there own distributed file system

A front end system to deal with the ammount of queries it gets (all 6million a second).

And lastly a team of brains that can beat the other team of brains at google.

PIerce

Pierce, Feb 9, 2007 IP

orengelo Peon

Messages:: 35

Likes Received:: 2

Best Answers:: 0

Trophy Points:: 0

#4

unless you have the capability to crawl as much of the internet as fast as google (every domain ((one page per domain)) on the internet every 20 minutes) you will never be able to compete with google(thats per crawl server at LEAST).
Click to expand...

I would say Yahoo crawlers are way faster than Google's. You can test it with a new website and see which engine crawls faster. Not too long I posted it here

I am sorry, couldn't resist. Lets stay on topic.

orengelo, Feb 9, 2007 IP

oseymour Well-Known Member

Messages:: 3,960

Likes Received:: 92

Best Answers:: 0

Trophy Points:: 135

#5

Wikipedia will never overtake google

oseymour, Feb 9, 2007 IP

trichnosis Prominent Member

Messages:: 13,785

Likes Received:: 333

Best Answers:: 0

Trophy Points:: 300

#6

search marketing is being more intresting day by day . i think the best think wait and see what will happen

trichnosis, Feb 10, 2007 IP

adnan Peon

Messages:: 1,614

Likes Received:: 82

Best Answers:: 0

Trophy Points:: 0

#7

Well, the thing is.

Google is run on pure technical knowhow. If any entity could provide relevant search results as relevant as google does when somebody types in a search phrase, they would get a piece of the market.

Wiki is run on a business plan. Anybody can program a wiki, with the right experience and technical staff.

Not anybody can write an efficient search engine like google.

Microsoft and Yahoo, the other big players in search still have a very low share of the search market.

Talk is all good, but when you get down to the b-trees, hashes and heterogenous data structures which fuel data mining, most analysts will be lost in their own spaghetti code.

adnan, Feb 10, 2007 IP

rehash Well-Known Member

Messages:: 1,502

Likes Received:: 30

Best Answers:: 0

Trophy Points:: 150

#8

orengelo said: ↑

I would say Yahoo crawlers are way faster than Google's.
Click to expand...

no way!!! only if you abuse the backlinks, yahoo may come faster...but google is always first to show your sites in serps and able to index deeply, unlike yahoo that stops after 40-50 pages

rehash, Feb 10, 2007 IP

brokensoft Active Member

Messages:: 214

Likes Received:: 1

Best Answers:: 0

Trophy Points:: 55

#9

wikipedia, you can see it in first results in any field any language.
may be when you search with your domain name you see it first result.

brokensoft, Feb 10, 2007 IP

Forsh Well-Known Member

Messages:: 510

Likes Received:: 14

Best Answers:: 0

Trophy Points:: 158

#10

I usually go to Wikipedia when I am looking for real information anyway. In Google, I usually run into pages upon pages of google adsense sites. Go figure.

Forsh, Feb 10, 2007 IP

NYDAz Peon

Messages:: 685

Likes Received:: 9

Best Answers:: 0

Trophy Points:: 0

#11

I think ... google will remain the best search engine

just my opinion

NYDAz, Feb 10, 2007 IP

ReadyToGo Peon

Messages:: 2,853

Likes Received:: 78

Best Answers:: 0

Trophy Points:: 0

#12

I go to Google to find articles on Wikipedia.

ReadyToGo, Feb 10, 2007 IP

kingspice Peon

Messages:: 104

Likes Received:: 0

Best Answers:: 0

Trophy Points:: 0

#13

i heard wikipedia are having financial troubles at the moment? Anybody heard this as well?

kingspice, Feb 10, 2007 IP

KalvinB Peon

Messages:: 2,787

Likes Received:: 78

Best Answers:: 0

Trophy Points:: 0

#14

There's no reason for Wikipedia to be having financial troubles.

If they do go under, it's because of the weight of their own stupidity. If you value idealism over financial sense then that's your problem.

The biggest flaw with Wikipedia is MediaWiki which is an inefficient mess of code. They need a new front end on their database to cut back on wasted bandwidth and processing power.

I can mirror Wikipedia on a PIII 900 no problem using my own front end. MediaWiki is unusable on that system with 2.7 million pages. I'm currently working on getting the updated 4.5 million pages imported into my system. Once it's done it'll be copied onto a $7 a month GoDaddy account.

The live version on my GoDaddy account is still using the old 2.7 million page wikipedia dump. Google is having a field day and I already am getting quite a few hits for a wide range of search terms.

KalvinB, Feb 10, 2007 IP

The Stealthy One Well-Known Member Affiliate Manager

Messages:: 3,043

Likes Received:: 54

Best Answers:: 0

Trophy Points:: 105

#15

Thanks for sharing that!

The Stealthy One, Feb 10, 2007 IP

geni Peon

Messages:: 83

Likes Received:: 0

Best Answers:: 0

Trophy Points:: 0

#16

KalvinB said: ↑

There's no reason for Wikipedia to be having financial troubles.

If they do go under, it's because of the weight of their own stupidity. If you value idealism over financial sense then that's your problem.

The biggest flaw with Wikipedia is MediaWiki which is an inefficient mess of code. They need a new front end on their database to cut back on wasted bandwidth and processing power.

I can mirror Wikipedia on a PIII 900 no problem using my own front end. MediaWiki is unusable on that system with 2.7 million pages. I'm currently working on getting the updated 4.5 million pages imported into my system. Once it's done it'll be copied onto a $7 a month GoDaddy account.

The live version on my GoDaddy account is still using the old 2.7 million page wikipedia dump. Google is having a field day and I already am getting quite a few hits for a wide range of search terms.
Click to expand...

You don't have to handle the edit rate and hit rate of wikipedia. Mediawiki is battle tested. At the moment nothing else in that area is.

It doesn't help that the foundation can afford all of 2 coders.

geni, Feb 11, 2007 IP

oseymour Well-Known Member

Messages:: 3,960

Likes Received:: 92

Best Answers:: 0

Trophy Points:: 135

#17

wiki only has about 5 employees....and they depend on google for a lot of their traffic...

oseymour, Feb 11, 2007 IP

KalvinB Peon

Messages:: 2,787

Likes Received:: 78

Best Answers:: 0

Trophy Points:: 0

#18

You don't have to handle the edit rate and hit rate of wikipedia. Mediawiki is battle tested. At the moment nothing else in that area is.
Click to expand...

Slow software is slow software. They could handle it better if the software wasn't a lumbering behemoth with unnecessary processing power required to run. Just initializing the MediaWiki class takes several seconds on a PIII 900. That's absurd. And then you have to get the data from the database and parse it using their convoluted text parsing code. A minute or so later you finally get your article.

Even simplifying the template that displays the articles would save a lot of bandwidth and processing power.

Companies would rather use more servers with more processing power and more bandwidth as a crutch to avoid dealing with a software problem. That's fine. But don't bitch about money then.

5 Employees should be able to take a step back from MediaWiki and develop a more efficient front end to the database.

Why should they use the inefficient MediaWiki to display content to the masses who don't need all that editing crap? It'd be trivial to default to a streamlined front end that only reads the wiki database until you click the edit button. You'd then be taken to the MediaWiki version on a seperate set of servers that allows editing, etc.

Block search engines from those servers and problem solved.

KalvinB, Feb 11, 2007 IP

geni Peon

Messages:: 83

Likes Received:: 0

Best Answers:: 0

Trophy Points:: 0

#19

KalvinB said: ↑

5 Employees should be able to take a step back from MediaWiki and develop a more efficient front end to the database.
Click to expand...

Most of the people employed by wikimedia are not coders.

Why should they use the inefficient MediaWiki to display content to the masses who don't need all that editing crap? It'd be trivial to default to a streamlined front end that only reads the wiki database until you click the edit button. You'd then be taken to the MediaWiki version on a seperate set of servers that allows editing, etc.
Click to expand...

um:

http://meta.wikimedia.org/wiki/Wikimedia_servers

Block search engines from those servers and problem solved.
Click to expand...

Search engine spiders do not put a significant load on the servers.

geni, Feb 12, 2007 IP

Pierce Active Member

Messages:: 634

Likes Received:: 26

Best Answers:: 0

Trophy Points:: 95

#20

orengelo said: ↑

I would say Yahoo crawlers are way faster than Google's. You can test it with a new website and see which engine crawls faster. Not too long I posted it here

I am sorry, couldn't resist. Lets stay on topic.
Click to expand...

i was talking about crawling not about indexing.

google crawls far more than yahoo, regardless of how fast or how much it indexs.

My site, 256MB this month to google, 64MB this month to yahoo. Major difference. And the same goes for every month.

Daily thats 2,500 pages, vs 400crawled.

Pierce

Pierce, Feb 12, 2007 IP

Log in or Sign up

Wikipedia founder takes on Google

2003m2003 Well-Known Member

moneyspeaks Peon

Pierce Active Member

orengelo Peon

oseymour Well-Known Member

trichnosis Prominent Member

adnan Peon

rehash Well-Known Member

brokensoft Active Member

Forsh Well-Known Member

NYDAz Peon

ReadyToGo Peon

kingspice Peon

KalvinB Peon

The Stealthy One Well-Known Member Affiliate Manager

geni Peon

oseymour Well-Known Member

KalvinB Peon

geni Peon

Pierce Active Member

Useful Searches