I've run an XML sitemap on my site, and it's listing some duplicate (or nearly duplicate) pages. For example, on each product page, there's a link saying "Email A Friend". When you click that, it takes you to a page where you can fill out email info. The product information is also displayed on this page. On the XML sitemap I'm getting one page like: http://www.mysite.com/product1 And another like: http://www.mysite.com/emailafriend/product1 I definitely want my product page to be indexed, but I don't know that having the email a friend page indexed adds anything. Should I exclude these duplicates from my sitemap?
Why not eother use noindex, robots.txt or configure sitemap crawler list filters. E.g. A1 sitemap Generator supports those, but other capable sitemappers should as well.