Cuil Spider Question

Discussion in 'All Other Search Engines' started by wsn, Aug 17, 2009.

  1. #1
    i just added a mod to my site that allows me to view the spiders that are currently crawling my site and came acrosse this spider the cuil spider i googled it and found that its a aparent website killing spider

    if this is true is there a way for me to block this bad boy

    either way if somebody could tell me about this spider good or bad i would appreciate it


    thanks rob
     
    wsn, Aug 17, 2009 IP
  2. golb12

    golb12 Peon

    Messages:
    132
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    0
    #2
    You must be bad at googleing, cuil is a search engine. cuil.com
     
    golb12, Aug 17, 2009 IP
  3. wsn

    wsn Peon

    Messages:
    23
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #3
    all i typed in search was cuil spider and found that
     
    wsn, Aug 17, 2009 IP
  4. xtreme fever

    xtreme fever Active Member

    Messages:
    472
    Likes Received:
    8
    Best Answers:
    0
    Trophy Points:
    95
    #4
    Cuil's spider is Twiceler. To block, use your robots file. Find more info about using your robots file at http://www.robotstxt.org
     
    xtreme fever, Aug 17, 2009 IP
  5. Gold_Hunter

    Gold_Hunter Active Member

    Messages:
    147
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    63
    #5
    Cuil claims their bot does respect robots.txt only after 7 days

    You can try to block the cuil IP address via your .htaccess file or block the spider itself

    To block an IP (using your .htaccess file):

    order allow,deny
    deny from 67.218.116.162
    deny from 67.218.116.163
    deny from 67.218.116.164
    deny from 67.218.116.165
    deny from 67.218.116.166
    allow from all
    Code (markup):
    or block cuil spider agent

    RewriteEngine on
       RewriteCond %{HTTP_USER_AGENT} ^Twiceler
       RewriteRule ^.* - [F,L]
    
    Code (markup):
    Your hosting must has mod_rewrite installed
     
    Gold_Hunter, Sep 2, 2009 IP