Hello, I´m working on www.iluminatia.com.ar and i´m using url´s with a "ñ" character. I know this is not very standart but i´ve seen other sites using ñ in their url´s and Google recognizing it. For example, check this out: http://www.google.com./search?hl=en&lr=&q=inurl%3A%C3%B1andu+123&btnG=Search If you enter the URL, it transforms the "ñ" character into "%C3%B1" which must be the utf-8 equivalent. Well, my site has a www.iluminatia.com.ar/diseño/diseño-de-sitios-web.php path, that actually transforms to: www.iluminatia.com.ar/dise%C3%B1o/dise%C3%B1o-de-sitios-web.php which should be correct. The problem is that Google isn´t indexing this page and on Google Webmaster Tools, i´ve got a 404 reported for it. Of course, the url is in the sitemap (check sitemap.xml on the domain name) Do you know why this is happening and how to fix it? Regards, Albion
You can check to see if the page is already indexed by G. If so, there really is no need to worry about putting it in a sitemap.
Nope, it´s not indexed. What´s more curious is that both "ñ" and "%C3%B1" ways of accesing the pages are valid for a IE and FF user. In fact, Google accesses Wikipedia and they use the "%C3%B1" encoding (utf-8). So why are my pages not being indexed ? =(
Maybe some of your inbound links are broken, and googlebot can't reach your pages via these broken links, so it will report 404 error.