Is there code available for generating a similarity score for checking duplicate content? e.g. two articles are 60% similar. Something that a site like copyscape would use.