Hi all, I was reading some spun rubbish the other day and it got me thinking about how to detect this kind of text. I'm a programmer and I am always trying to stretch my skillz. Last year I started a dating site and I stuck in an algorithm to detect scammers. It's working pretty well. I think it would be similarly straightforward to detect spun text. But I wondered if there was any point to me spending the time writing the algorithm [and no, my solution would not use Copyscape]. Do people still spin stuff? And why doesn't Google penalise it. My theory is that it's very easy to detect it, and it's also not computationally intensive to do so [my dating site scammer detector works in real time]. In fact I would think that Google could use this exercise as a job interview for new programmers. So why don't Google de-index the spun stuff?
You are a programmer that's why you are thinking logically. They might know the solution but the business world is different . They want the mess created up and that's why people will be willing to spend money to get top rankings and so.
You'd think it couldn't be that hard, right. After all there are sites that can read text and tell if the writer was male or female... surely "bot" should be the third option? I'd like to think that @webdev007 is wrong but you do have to wonder if adwords spending isn't part of it.