1. Advertising
    y u no do it?

    Advertising (learn more)

    Advertise virtually anything here, with CPM banner ads, CPM email ads and CPC contextual links. You can target relevant areas of the site and show ads based on geographical location of the user if you wish.

    Starts at just $1 per CPM or $0.10 per CPC.

How to Block All Robots to Crawl/Index All html Pages within My Website

Discussion in 'robots.txt' started by vema123, Jul 9, 2019.

  1. #1
    How to block all robots from indexing/crawling all the html pages in my website. I can't block the folders because all those html pages are distributed in so many folders. So I only want all robots NOT to index/crawl all html pages within my website.
    SEMrush
     
    vema123, Jul 9, 2019 IP
    SEMrush
  2. vema123

    vema123 Active Member

    Messages:
    43
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    91
    #2
    Is this correct (to block all robots from crawling all html pages):

    User-agent: *
    Disallow: /.html

    Allow: /wp-admin/admin-ajax.php

    Sitemap: mydomain/sitemap.xml
     
    vema123, Jul 9, 2019 IP
  3. mmerlinn

    mmerlinn Notable Member

    Messages:
    2,199
    Likes Received:
    277
    Best Answers:
    6
    Trophy Points:
    260
    #3
    There are only three ways to block all bots. 1) password protect your pages, 2) block the bots on your server, or 3) do not put your pages on the public internet. Anything else that 'works' will block honorable robots but will not stop rogue robots. Since you specifically stated ALL, you will need to do #1 #2, or #3 above.
     
    mmerlinn, Jul 9, 2019 IP