Hello, I add a directory called /forms/ to disallow list in my robot file Disallow: /forms/, which I do not want Google to index, but it is still indexed by Google. And when I use site:/mydomain.com/forms/ on Google, it returns over 86000 pages. But my whole website has only 83000 indexed by Google? (When I use site:mydomain.com, it returns 83000). How can this happen? If I add noindex to all pages under /forms/, can Google ignore them? Appreciate you time.
Hi PremierFabio Disallow: /forms/ will work but, Google will take some time to remove from pages from its database till then u can try Noindex meta tag if u r not happy with robots Disallow
If you search site:mydomain.com, the indexed pages would be greater than the indexed pages when you search site:mydomain.com/forms/. There is no possibility that this can be reversed. As far as the already indexed pages are concerned, I would like to apprise you that Crawlers will take some time to remove these pages from the index. If you want to remove them now, you can remove them from Google webmasters tool.
You suggest me use "remove url" in Google Webmaster tool? It is said only pages direct to 404 or 401 can be removed, but my pages can be opened as normal pages
Block it by robots.txt then submit removal request from your Google Webmasters Tools. Google will remove it from SERP by 24 hours.