Nintendo, would you know the best rewrite code to use to redirect their own IP address back onto their homepage?
Why bother redirecting back to their site? I guess there is the vindictive angle to it. I'll just use this to block them with forbidden. RewriteEngine On RewriteCond %{REMOTE_ADDR} ^128.111.52.62$ RewriteRule .* - [F,L] Code (markup): I suppose if you really wanted to you could replace the "-" in the final line with a URL of your choice and remove the "F," ...
Here is a sample code to do that via PHP: if ($_SERVER["REMOTE_ADDR"] == "128.111.52.62") { header("Location: http://www.com-web.org/"); } PHP: You need to add it at the top of your .php file. There is a problem though. Their IP changes, I have tested. My question is if there are any valid users coming from the 128.*.*.* IP range. If you dare blocking all 128.* traffic then here is what you need: $range = substr($_SERVER["REMOTE_ADDR"],0,3); if ($range == "128") { header("Location: http://www.com-web.org/"); } PHP: Please note that if your page has already been mirrored or you tested it with cob-web a short while ago, allow some time to pass before you try again since they are updating their cache hourly I think. Hope you'll find it useful.
With all the super computers, super gurus and super algo, google can still allow for this. Unbelievable ...
I guess that this is where google get's the info to cache(?) their pages: http://72.14.253.104/search?q=cache...tml+site:cob-web.org&hl=en&gl=us&ct=clnk&cd=2 Not sure if anyone at Google is aware but today the results show up as "In order to show you the most relevant results, we have omitted some entries very similar to the 2 already displayed. If you like, you can repeat the search with the omitted results included. "
I would really like to read about the purpose of the project, who all is involved etc. I almost get the impression that it was an experment gone awry.
Has anyone seen this: http://www.cs.cornell.edu/people/egs/beehive/cobweb/sites.php Really confused over this..can't figure out this cob-web
if you go here http://www.google.com.cob-web.org:8888/ and do a search for your site for example. Maybe your links to your site. Then look at the results. Look at the links and go to sites that link to you. All links are rewritten and the cob web is added to the links. Even in the so called google search.... What if Google crawls these pages?
Purpose? I see one for now. Google is dumping dounle content from its index and banning websites involved in it. cob-web could well be some stupid time-wasters that have got nothing else to do than drive their rovers and play online. Or they may be purposely paid people to break google ranking so that competition websites all go down..
I don't know if it a hack (probably not!) nor a hijack but if Google gets hold of pages from that site, what would happen? Maybe we all should email that university and tell them to stop doing it? How many emails would that be a day?
Maybe they'll be PhD's one day and work for Google If you use "their" Google page (google.com.cob-web.org:8888) and you do a site:cob-web.org it is not showing anything from their site. It rewrites the query to site%3 !!!! Now, why would you do that if you don't have anything to hide? AND an UPDATE! Google IS indexing the sites! http://www.google.com/search?hl=en&lr=&q="south+pole+clothes"
http://www.planet-lab.org.cob-web.org:8888/doc/TechsGuide.php#id270081 Read section 5.3.1, and check out the links there. You might look at these, too: http://www.google.com/search?q=site:cob-web.org:8888/&filter=0 At number 4, you will see Google listed with the title 'try it'... which comes from this site: http://www.cs.cornell.edu/people/egs/beehive/cobweb/instructions.php See the anchor text in the Rewriting URLS section? This is how the title gets overwritten with keywords in the serps... interesting. It's probably link pages like the one you pointed out that got the sites listed, or a stats page that got left open to crawling. Another guess would be this site: http://www.quantcast.com/cob-web.org Following the links on the left side of the page brings up another quantcast url with the stats on a cob-web.org proxied page, and a link to it... But I don't see any problems with these proxies themselves, as I have in the past with proxy trash. The only site I have found cached is this: http://209.85.165.104/search?q=cach...578.html+cob-web.org&hl=en&gl=us&ct=clnk&cd=4 They seem to have blocked it... clicking on the actual proxy URL brings up a page explaining this.
It's really difficult to check all the IPs when you have like a million in log. What is the best solution and where should we look for proxy ip listing?