How to get the 404 pages out from index?

Discussion in 'Google' started by Kassi, Feb 17, 2010.

  1. #1
    I work with Google for quite a while, and the main problem that I had so far was to get indexed, however now I have the need to do the contrary thing, to get pages de-indexed.

    For example, I have domen.com, and there was a weak living forum for couple years. I moved the forum to another place and now I have a store instead of it. I couldn’t move it via the webmaster panel, as I was moving to a subdomain, and the tool was not doing correctly with subdomains. I thought that if I put 404 the problem would be solved by itself.

    Unfortunately, I got the problem with the Google index. By the time of moving there were 100K pages in the Google index.

    I installed the store, and it has filters for the goods. The filter links are restricted in the robots.txt, the filtering results are closed by the “noindex” meta tag.

    It seamed that everything should be working fine, but not.

    Currently, the site:domain.com shows 389,000 pages. I waited for 3 months, and there is no sense to wait longer. Calling inurl shows that all forum tags are still there, moreover, all filters (that supposed to be closed in all possible ways) are also in the Google index.

    In the actual SERP there is only 300 pages. Everything else is in the supplimental results, and under some kind of very strict filter, which is proved by the stats of the webmaster tools

    [​IMG]

    sitemap.xml has only 526 pages indexed out of 3700 pages.

    The Google bot runs all over the site, actually not going out, including the products, but the rubbish is not getting out.

    How to get this rubbish out from the index?
     
    Kassi, Feb 17, 2010 IP