Somehow google has taken it upon themselves to index my site's pages multiple times. For example: www.mysite.com/url-path-to-the-video.html www.mysite.com/url_path_to_the_video.html Never happened before but all of a sudden google has decided to put underscores where hyphens should be and they count it as 2 separate pages. Obviously this will be considered duplicate content. No penalties have been enforced yet but I don't want to wait around for that to happen. What can I do?
Maybe submit a new sitemap.xml and use a robots.txt file to disallow the urls you don't want to be crawled/indexed? Never seen that before, but just the other day (and it hasn't happened since) my homepage (index) appeared twice in search results for the same keyword - once on P1 and once on P2. Weird.
Try contacting google about it, or have the 2nd page removed. ( http://www.google.com/support/webmasters/bin/topic.py?topic=8459 ) Also check your with your webhost too to make sure there aren't any configuration problems.