How To Stop Indexing of Unwanted Pages in Google?

Discussion in 'Google' started by webmaster-32B2, Jan 14, 2013.

  1. #1
    Hi,

    Can anyone tell me please how to stop indexing of the unwanted and automatically script generated pages in Google.

    Also how to de-index such pages?

    Any help will be appreciated.

    Regards
    WM
     
    webmaster-32B2, Jan 14, 2013 IP
  2. wizardofx

    wizardofx Well-Known Member

    Messages:
    572
    Likes Received:
    10
    Best Answers:
    0
    Trophy Points:
    140
    #2
    Add a special character to the link and disallow it in robots.txt . Since our shopping cart links
    all have a question mark I use:

    Disallow: /*?

    wiz
     
    wizardofx, Jan 14, 2013 IP
  3. spider_web

    spider_web Greenhorn

    Messages:
    306
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    18
    #3
    301 is definitely the way to go, especially if you have inbound links to the unwanted pages. That will preserve most of your rank and transfer it to the actual page.
     
    spider_web, Jan 15, 2013 IP
  4. rakesh.swaransoft

    rakesh.swaransoft Greenhorn

    Messages:
    119
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    18
    #4
    Just add a code in robots.txt file

    Disallow: /*?
     
    rakesh.swaransoft, Jan 15, 2013 IP
  5. coakleylincoln

    coakleylincoln Greenhorn

    Messages:
    286
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    18
    #5
    Thanks. I don't know this note before. Simple code!
     
    coakleylincoln, Jan 16, 2013 IP
  6. swift551

    swift551 Greenhorn

    Messages:
    50
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    6
    #6
    try disavow tool to remove bad links for your website
     
    swift551, Jan 16, 2013 IP
  7. chaithu

    chaithu Member

    Messages:
    218
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    48
    #7
    You can do with that in two ways..!
    ! By doing with ROBOT.TXT
    !! "No follow" attribute to the page or image that you don't want to index...!
     
    chaithu, Jan 17, 2013 IP
  8. MihaiStamate

    MihaiStamate Peon

    Messages:
    27
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    1
    #8
    Robots.txt must be modified in order to disallow some pages to be indexed.
     
    MihaiStamate, Jan 17, 2013 IP
  9. zari123

    zari123 Peon

    Messages:
    3
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #9
    Identify site pages from google webmaster tools and use tags "no index" and "no follow" to completely remove pages from google index.
     
    zari123, Jan 17, 2013 IP
  10. manoj998

    manoj998 Greenhorn

    Messages:
    33
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    6
    #10
    You have two ways to do, following those easy ways:
    1: Go to your root folder, there must have robots.txt file and put this line:
    Disallow: / your page name or folder name (which you want to no index from search engine)
    2: On your page meta section write '<META NAME="ROBOTS" CONTENT="NOINDEX, NOFOLLOW">'

    Hope this will help you out.

    Enjoy... :)
     
    manoj998, Jan 18, 2013 IP
  11. Amit Kumar Mondal

    Amit Kumar Mondal Well-Known Member

    Messages:
    148
    Likes Received:
    5
    Best Answers:
    0
    Trophy Points:
    140
    #11
    By using robots.txt you can able to hide any unwanted page from search engine bot or web crawler.
     
    Amit Kumar Mondal, Jan 24, 2013 IP
  12. rudradhar

    rudradhar Member

    Messages:
    39
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    38
    #12
    Two methods I know for de-indexing and stopping crawlers to crawl webpages.
    • Robots.txt
    User-agent: *(name of bot)
    Disallow: (name of a file followed by /)
    If you want to block particular directory than Disallow: /directory-name/ will work

    • Meta Robots tag
      • <meta name="robots" content="noindex, nofollow" />
      • <meta name="robots" content="index, nofollow" />
      • <meta name="robots" content="noindex, follow" />
     
    rudradhar, Jan 24, 2013 IP
  13. kaliborobb

    kaliborobb Member

    Messages:
    20
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    31
    #13
    How can you call yourself "WEBMASTER" if you cant answer your own basic questions. 301 is definitely the way to go, especially if you have inbound links to the unwanted pages. That will preserve most of your rank and transfer it to the actual page.
    As far as blocking goes, you can use a robots.txt or a robots meta tag on your page with a NOFOLLOW,NOINDEX
     
    kaliborobb, Jan 25, 2013 IP
  14. Shu006

    Shu006 Well-Known Member

    Messages:
    16
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    111
    #14
    use robots.txt to set unfollow for some links
     
    Shu006, Feb 3, 2013 IP
  15. chetan261992

    chetan261992 Member

    Messages:
    52
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    36
    #15
    Here are two easy resolution and fast to stop search engines from indexing your site -
    1st - Use a specific meta tag
    For each page that you don't want to appear in search engine results, have only one <meta> tag. Not a description, not some keywords, only a single <meta> tag for robots.
    <meta name="robots" content="noindex,nofollow,noarchive" />
    Put that in <head> of each page, and you're telling search engines not to index the page, do not follow any links on the page and do not archive the page.
    2nd - Create a robots.txt file
    If the pages are in a separate directory, you can also block using a robots.txt file to search engines.
    Create a text file and in it, are not all of the directories that you want to protect:
    User-agent: *Disallow: /nameofdirectory
    Disallow: /anothernameofdirectory
    Do it for all the directories that you want, and then save the file as robots.txt file and upload it to the root directory. Search engine bots will hit your robots.txt file, find out which directories you do not want them in and skip them.
    So here you go. Two little things that can save the world from the problem.
    Take a choice you wish to do and have fun!
     
    chetan261992, Feb 3, 2013 IP