Hi all, I'm working on a site for a company that sells widgets. I've got the products in a database and now i've mod rewritten their catalog with completely spiderable with very friendly URL's in two languages. site.com/widgets/catalog/en/ site.com/widgets/catalog/de/ from those pages that has the basic list, you can view individual widget pages /widgets/catalog/en/widget1.html they've also asked me to make a "widget selector" where people can pull up "all blue widgets" or "all large widgets". It's all dynamic, but rewritte to look like this: /widgets/selector/blue-widgets /widgets/selector/large-widgets I want to add a second level of selecting to these, so we'll have pages like this /widgets/selector/blue-widgets/large-widgets/ or even /widgets/selector/large-widgets/blue-widgets/ The point of these pages isn't to spam the SE's, it's to make lists of different widgets that people might want, and i've rewritten them because catalog.php?lang=en&item=widget1&selector=blue_widget My question is, do you think these pages will get flagged by duplicate content filters, or will they show up when people do searches for the products contained within them? There will obviously be some overlap between pages, but I dont think any of them will be identical. Anyone have experience with something like this?
A duplicate content flag is triggered when the search engines find 2 or more pages that are identical or almost identical to each other. Certainly you will raise a duplicate flag if you have two similar pages even on different directories or sub-directories. Only "what google considers" as the most important page will show up most of the time, and in case you have too many similar pages, depending on the link structure and to what extent the search engines consider it severe, the duplicate pages, the whole sub-directory, directory or domain can get penalized. You must also be careful because letting the search engines index too many fresh pages at once will raise a second penalty according to google's new patent. This has become a very big problem for dynamic site owners as of late. p.s. pm me the link