Duplicate content but disallowed in robot.txt

Discussion in 'Google' started by John LaO, Dec 30, 2008.

  1. #1
    Hello guys,

    i have a folder in my root directory which contains the backup files of my entire website..

    my question is, will google still "unindex" my site because of that duplicate content even if i disallow it in my robot.txt?

    just wanted to make sure though..:eek:
     
    John LaO, Dec 30, 2008 IP
  2. suman817

    suman817 Well-Known Member

    Messages:
    1,777
    Likes Received:
    378
    Best Answers:
    0
    Trophy Points:
    175
    #2
    No, those backup files will not be indexed by search engs, if there are no links with the files in backup folder. Be sure that, no links established from any other files in root directory with the files in backup folder. Then Google will not unindex your main pages and it will not notify about the duplicate files in backup folder.
     
    suman817, Dec 30, 2008 IP
  3. hasbehas

    hasbehas Well-Known Member

    Messages:
    726
    Likes Received:
    24
    Best Answers:
    0
    Trophy Points:
    190
    #3
    it would not index it, so there will not be a duplicate. No worries..
    What user-agent did u use (*) general or just (google-bot) ?
     
    hasbehas, Dec 30, 2008 IP
  4. John LaO

    John LaO Peon

    Messages:
    278
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #4
    i used *.. so all bots wont access those pages... its actually a duplicate content of a forum.. i copy and pasted everything in my root folder and store it in another folder.
     
    John LaO, Dec 30, 2008 IP
  5. hasbehas

    hasbehas Well-Known Member

    Messages:
    726
    Likes Received:
    24
    Best Answers:
    0
    Trophy Points:
    190
    #5
    then it should be fine... dont worry..
     
    hasbehas, Dec 31, 2008 IP
  6. gameutopia

    gameutopia Peon

    Messages:
    975
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    0
    #6
    If they are just backups you could maybe set the file/folder permissions to a level that they can't be viewed or accessed, like 000 maybe. Otherwise you might pass protect with .htaccess just to prevent bots, and even snoops. Possibly even .htaccess deny all except your ip address. Just a few thoughts.
     
    gameutopia, Dec 31, 2008 IP