hi all i have some pages on a site that i need to keep them not indexed, so i use the meta robots "noindex" tag. now I need to link to these pages. will these pages still be unidexed if they have incoming links from other domains? i wll also add a rule on robots.txt - disallowing to crawl to these pages.
Nofollow and robot.txt is the best way to stop google from crawling links and pages. You are on the right track.
Yeap, if your robots.txt is written correctly.. the site in question sshouldn't get indexed. But to make sure I would add an htaccess and an .htpwd file to disallow search engine crawlers. Matt Cutts has a nice video about this topic too..
if you use a tag noindex, then search engines not index you. index, nofollow will index your page but not follow your links in the page index, follow will index and follow all of the links with in the page noindex, nofollow not indexes the page Using of robots.txt is the best way, rather than using of meta tag noindex in code.
Use this html code at the top of each page you don't want to be indexed dude <html> <head> <title>...</title> <META NAME="ROBOTS" CONTENT="NOINDEX, NOFOLLOW"> </head> HTML:
The <meta name="robots" content="noindex"> element with prevent the page from being indexed no matter who links to it (you or other sites) or how many links it has. I would NOT add a Disallow: in your robots.txt. It's not necessary if you already have <meta name="robots" content="noindex"> in the <head> of the page and can sometimes lead to other undesirable side effects. Also I would use <meta name="robots" content="noindex"> or <meta name="robots" content="noindex,follow">. I would NOT use <meta name="robots" content="noindex,nofollow">.