1. Advertising
    y u no do it?

    Advertising (learn more)

    Advertise virtually anything here, with CPM banner ads, CPM email ads and CPC contextual links. You can target relevant areas of the site and show ads based on geographical location of the user if you wish.

    Starts at just $1 per CPM or $0.10 per CPC.

Don't Want Ads on a Certain Page

Discussion in 'Co-op Advertising Network' started by ResearchTechs, Dec 22, 2004.

  1. #1
    Though my robots.txt Disallows access to www.researchtechs.com/fast.htm somehow Google decided to index it. Since the ad network was showing zero weight without the ads on this site, I put them on for now, but I would really like not to have to put them there. Is there any way to exclude a single address from being required? Note: I recently added the meta tags noindex and nofollow as well, but they had not previously been there. Does Google totally ignore robots.txt?
     
    ResearchTechs, Dec 22, 2004 IP
  2. T0PS3O

    T0PS3O Feel Good PLC

    Messages:
    13,219
    Likes Received:
    777
    Best Answers:
    0
    Trophy Points:
    0
    #2
    In the Webmaster Guidelines is a links somewhere to get unindexed I believe. Have you tried changing the robots file to say /fast.htm? Not sure if that's it but seems better to me.
     
    T0PS3O, Dec 23, 2004 IP
  3. jim

    jim Well-Known Member

    Messages:
    816
    Likes Received:
    53
    Best Answers:
    0
    Trophy Points:
    153
    #3
    I know Google doesn't totally ignore robots.txt, but when Google finds a direct link to a page they might be indexing it without reading the robots.txt...
     
    jim, Dec 23, 2004 IP
  4. dustin

    dustin Peon

    Messages:
    69
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #4
    I'm pretty sure google respects robots.txt for all their stuff. People would jump on them in a second if they didn't.
     
    dustin, Dec 23, 2004 IP
  5. zamolxes

    zamolxes Peon

    Messages:
    176
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #5
    There is a post on Searchenginewatch about Google and robots.txt

    http://forums.searchenginewatch.com/showthread.php?t=3436

    Unfortunatelly my experience confirms that google shows at least some first level pages excluded by robots.txt as URL only listings.

    I know of a few pages on different sites that can prove it and anyone can check. At first I thought that happened because some of them might have been excluded in the robots.txt (or the robots meta tag) after being indexed. However that is not the case as I know myself of 2 pages that have been excluded in the robots.txt from the very beginning - when the site was first uploaded- and they still apear in the google index as URL only listings.
     
    zamolxes, Dec 23, 2004 IP
  6. crew

    crew Peon

    Messages:
    225
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    0
    #6
    You can add 'noindex' to the page itself in a Meta tag, and then somewhere on Google's site, you can request that they de-index a page. Googlebot will come take a look at the page, make sure the 'noindex' is there, and then remove the page from Google's index pretty quickly.

    I did this a few weeks ago for a single page and it was gone in about 24 hours.
     
    crew, Dec 23, 2004 IP
  7. zamolxes

    zamolxes Peon

    Messages:
    176
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #7
    The problem is that some sites add/update/change regularly a large number of pages (100s or 1000s). How do you do it then as de-indexing by url becomes virtually impossible!
    On top of this google now "indexes" even many pages excluded in the robots.txt file as "url only" listings.

    As far as I know, the coop validation process can check at random any page on your site that is in google's index (including "url only" listings and pages no longer existent)
     
    zamolxes, Dec 23, 2004 IP
  8. crew

    crew Peon

    Messages:
    225
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    0
    #8
    My guess is that the API does not return URL Only listings.

    (I realize this was not a good solution for hundreds of pages. The original poster only had problems with a single page)
     
    crew, Dec 23, 2004 IP
  9. zamolxes

    zamolxes Peon

    Messages:
    176
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #9
    I'm afraid in my experience it does! I got spot checks even on the pages that usually appear in the:
     
    zamolxes, Dec 23, 2004 IP