Hi hope someone can help me. Most of my website links are like this one http://www.mysite.com/article.aspx?page=1&id=12&fid=32 Today I went to Google Webmaster Tools and saw that google crawled 3812 Not found pages in my website WTF... then I saw the links and the not found pages are http://www.mysite.com/article.aspx&page=1&id=12&fid=32 ... google replaced "?" by "&" ...
I really doubt Google mixed those up, especially if they pick up the links verbatim from your a href code. They're really good at crawling the web and organizing information they find there; it really doesn't make sense they would go messing with a good URL to break it. Do you have access to your server logs? The raw ones? I'd be really curious if they have referral info attached, or anything else that might be useful...? I wouldn't think the & char would be legal for part of a file or path name ... I'd really expect IIS to turn control over to your article.aspx page, with a messed up query string. Apparently not, but that's pretty bizarre. Is it possible someone you linked to on that page tried to add it to some directories, or mentioned it in a blog, to try to get more link juice? Or that a competitor threw up a bunch of links to invalid pages, like if they were trying to do a denial of service using a spider as a proxy? I don't know, that's kind of a reach, but this is a pretty bizarre situation.
there is a real about the googlebot . googlebot does not like the urls like /article.aspx&page=1&id=12&fid=32 .
Google never like special characters(?, &) in the URL, so we make htaccess file for dynamic urls. Now can can make a robots.txt file & put the url there showing not found page error. or U can go for google auto URl removal tool to deindex the URLs.