Google no following robots.txt

Discussion in 'robots.txt' started by ian_ok, Jun 7, 2005.

  1. #1
    I've just noticed a big jump in the number of pages Google has cached, but some of them are of pages that my robots say's NO to.

    Why is this happening?

    User-agent: *
    Disallow: /forum/admin/
    Disallow: /forum/db/
    Disallow: /forum/images/
    Disallow: /forum/includes/
    Disallow: /forum/language/
    Disallow: /forum/templates/
    Disallow: /forum/common.php
    Disallow: /forum/config.php
    Disallow: /forum/faq.php
    Disallow: /forum/groupcp.php
    Disallow: /forum/login.php
    Disallow: /forum/memberlist.php
    Disallow: /forum/modcp.php
    Disallow: /forum/posting.php
    Disallow: /forum/privmsg.php
    Disallow: /forum/profile.php
    Disallow: /forum/search.php
    Disallow: /forum/viewonline.php
    Disallow: /mail/

    My forum is at:
    w w w.sanlucar-de-barrameda.com/forum/index.php AND the above robots is correct to me!

    Thanks

    Ian
     
    ian_ok, Jun 7, 2005 IP
  2. pwaring

    pwaring Well-Known Member

    Messages:
    846
    Likes Received:
    25
    Best Answers:
    0
    Trophy Points:
    135
    #2
    When did you add the robots.txt file? It might take a while for Google to pick up the new version (they certainly don't request it every time) and longer still for them to automatically purge any pages left in their results/cache.
     
    pwaring, Jun 7, 2005 IP
  3. noppid

    noppid gunnin' for the quota

    Messages:
    4,246
    Likes Received:
    232
    Best Answers:
    0
    Trophy Points:
    135
    #3
    I used to use the php file excludes too. They seemed to be ignored then too.

    Mine were exactly the same as your's. I dunno, I just forgot about it till I read this.
     
    noppid, Jun 7, 2005 IP
  4. ian_ok

    ian_ok Peon

    Messages:
    551
    Likes Received:
    11
    Best Answers:
    0
    Trophy Points:
    0
    #4
    It was just uploaded the weekend just gone, a very slight change on the previous with 3 additions being DB, groupCO and modCP.

    Guess we can't really do anything about it.

    On another site I had an OLD file that no longer existed which was indexed by Google and I wanted to add the co-op ads to but couldn't as I didn't have htaccess file, so waited about 2 months before it left Google's index!

    Thanks Ian
     
    ian_ok, Jun 7, 2005 IP
  5. someonewhois

    someonewhois Peon

    Messages:
    177
    Likes Received:
    14
    Best Answers:
    0
    Trophy Points:
    0
    #5
    someonewhois, Jul 8, 2005 IP
  6. ziandra

    ziandra Well-Known Member

    Messages:
    142
    Likes Received:
    11
    Best Answers:
    0
    Trophy Points:
    138
    #6
    Google started ignoring my noindex,nofollow,noarchive metas for a couple of days. Heck, they even removed them from the cached copies!

    But then a day or two later things went back to normal. I almost posted something on it but it fixed itself so I didn't bother.
     
    ziandra, Jul 13, 2005 IP