Old results catched in google (Google experts I need your help)

Discussion in 'Site & Server Administration' started by ridesign, Jun 14, 2006.

  1. #1
    Is there a reason why google begins to displaying old pages which were cached jun 2005?

    Also I had a redirect from the domain.com to the www.domain.com which was fine in google but now google has my old page cached in the domain.com and the new page on www.domain.com

    Also recently my site has had a lot of supplement results in google for an old folder so i have put a removed all pages in the folder so it gets a 404 page is this recommended and put a disallow in the robots.txt?

    And no new pages don't seem to be gettings indexed even though i have submitted a google sitemap.

    Any help much appreciated. :)
     
    ridesign, Jun 14, 2006 IP
  2. wibr

    wibr Peon

    Messages:
    206
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    0
    #2
    Well as I have discovered recently on this forum and many others, this problem isn't new. So you're not alone. I'm having the exact same issue. Hunt around the forums. There's no answer. Yet...
     
    wibr, Jun 14, 2006 IP
  3. minstrel

    minstrel Illustrious Member

    Messages:
    15,082
    Likes Received:
    1,243
    Best Answers:
    0
    Trophy Points:
    480
    #3
    No answer, no. But there are several other threads here already discussing this.
     
    minstrel, Jun 14, 2006 IP
  4. Jean-Luc

    Jean-Luc Peon

    Messages:
    601
    Likes Received:
    30
    Best Answers:
    0
    Trophy Points:
    0
    #4
    Hi,

    I would not disallow the non-existing pages in the robots.txt. If the robots are not allowed to access the pages, they cannot see that the pages do not exist anymore.

    Jean-Luc
     
    Jean-Luc, Jun 15, 2006 IP
  5. minstrel

    minstrel Illustrious Member

    Messages:
    15,082
    Likes Received:
    1,243
    Best Answers:
    0
    Trophy Points:
    480
    #5
    But if the object is to remove the pages from the index, disallowing them in robots.txt should have the desired effect.

    Or I should say, prior to Big Daddy that would have worked.

    (for a recent example, see the case of WebMaster World)
     
    minstrel, Jun 15, 2006 IP