One would think that it doesn't take some poor schlep sitting around all day looking for these pages. It should be easy enough to automate the detection process and flag such sites for review and deindexing.
I agree. I also think the human touch would help though. So why not do both? In the example before this one a site went from 0 pages to, what was it, 750,000 pages in 2 weeks or something? How can that not be a red flag??? No one, not even the largest corporations throw up sites that large and that fast. ONLY SPAMMERS do that. So why can't google figure that out? Again, we're back to the real question - Does Google really want to end spam? My answer is still no. I see no motivation (+ $$$) for them to do so.
Google isn't strapped for cash...they are making a killing out of Adwords. In two of my niche's the cost to be in the top 5 has doubled in the last month making it break even for me but hugely profitable for Google. All the niches I use Adwords in the cost has risen and I never use content so it flows through the system at our expense. Today I turned all my Adwords programs off. This actually has relevence to this thread because Google are cashing in on all the clicks from those pages as shown by Googles results form the last quarter. Coincidently the same quarter when this all started Google (NASDAQ: GOOG) The Internet search firm, after Thursday’s closing bell, said its second-quarter profit more than doubled as the company expanded its leading share of the market for online search advertising.
Here's another interesting twist to the sub-domain spaming instead of raising red flags by generating millions of links pointing to one domain, it uses scores of domains that each have tens of thousands of sub-domains linking to each other. They are all set up to look like blog, but way down the comments section what appears to be a long comment is actually a large section of text scraped from other people's pages and smashed togther as giberish. Here is one example: http://orlimar-hip-steel-replacement-irons.tokyout.be The left hand links are all paid CPC links and the right hand menu are links to more sites of the exact same nature just using different scraped text. All of these domains seem to be registered under the TLDcc BE. There could be hundreds of thousands if not millions of pages in this sub-domain spam ring. Again, I've disabled links to not provide backlinks to the domain in question.
I see these 2 spammy sites in the top 3 positions on a lot of (totally unrelated) search results. Just take a look of the keywords they are stuffed with :O pharma.nigerianmaritimedirectory.com pharma.mvplover.org
Okay, here's today's subdomain spammer: http://www.google.com/search?q=inurl:kje-ob3.org (1,730,000) Sample subdomain: 12644.job.kje-ob3.org Surely if I'm able to find these listings in Google's index on a daily basis via services using Google's API, Google could find them much easier.
This must be data center differences, I'm still seeing the 1.73 million results. --Edit-- http://whois.domaintools.com/kje-ob3.org The domain was registered on July 14, 2006 and is based in Argentia
b.a, bullshit If google sitemaps are so useful - I'll be the richest man in my country KLB, you are right Check http://www.linkadage.com/linkadage-info/metrics_checker.htm for this website.. 50%:50% 1 730 000 : 546
Deep down I know some of yoy guys are jeallous of these spammers. If you had the ability to make million of feshly indexed pages at the push of the button, would you not act? I know most of you if given the chance to would. Cmon....your lying if you say no.
I have no problems getting new pages indexed, it isn't like getting indexed is rocket science. So am I jealous, no I am not. I simply want Google to clean up their act and get better at spoting and stoping this type of spam on their own. If search engines got better at detecting this stuff and eliminating it from their SERPs, they would dry up traffic and revenue to these sites, which would eliminate the incentives to do this stuff.
As long as search engines use automation to index and rank websites these types of sites will keep popping up. Google and the other SE's can tighten things up but that will only make innocent webmasters suffer as well and we'll have more incidents such as June 27th.
Yes I agree with you. This certainly is an interesting challenge for Google.They need to clean up this sort of mess and index pages with real contents. Otherwise Google is just going to be an index full of spams.
ROFL...yea that was kinda pointless Most of of this would be fixed if google hired just ONE person to comb over the serps for 4hushf5.45478yt3478ty3.org/info type spam. Google doesn't even make money off these sites since none of them even use adsense so it is in thier best interestt o remove them anyway. I could ellimite hundreds of the damn things in an hour if given the ability to. The report spam thingy is a joke.