I have a website that i want to clone and give a face lift. The sites are exactly the same content wise. I'm guessing i will get hit for duplicate content by google. How can i avoid this? Thanks.
if you want to prevent all robots from indexing then use following meta tag: <meta name="robots" content="noindex"> OR if you want to preventing only Google's robots from indexing the page: <meta name="googlebot" content="noindex"> Through robots.txt use following code <META NAME="ROBOTS" CONTENT="NOINDEX, NOFOLLOW">
I got a sense the OP was looking to have Google index the second site as well but not penalize it. I may have misunderstood. Why are you creating 2 identical sites in the first place? Why not give the original the facelift?
Exactly, it sounds queerly. OP doesn't have any way to avoid getting hit for duplicate content in such case IMHO.
In this way, you risk to be penalized for every of your 2 sites. Canonical tag is for pages from the same domain, as say yfs1, i suggest "noindex, nofollow" tags for your clone site.
You can try http://www.copyscape.com/ to know if your articles have duplicate content issues.If you are sure that your content is duplicated,you'd remove your content before any official Penality.