THOUSANDS OF SITE BEING HIJACKED!! Is yours?

Discussion in 'Google' started by MikeSwede, Jan 20, 2007.

  1. goscript

    goscript Prominent Member

    Messages:
    2,753
    Likes Received:
    306
    Best Answers:
    0
    Trophy Points:
    315
    #101
    I dont getit... why would someone place that in the title of a webpage? Or... that was placed by ... cobweb
     
    goscript, Jan 23, 2007 IP
  2. jackburton2006

    jackburton2006 Peon

    Messages:
    5,296
    Likes Received:
    282
    Best Answers:
    0
    Trophy Points:
    0
    #102
    This happened to me a few years back. Five years worth of original content hijacked by some kid who didn't know any better. Took me months to get him to remove all of it and for Google to take action. To their credit, Google finally did and restored me in their index.
     
    jackburton2006, Jan 23, 2007 IP
  3. goscript

    goscript Prominent Member

    Messages:
    2,753
    Likes Received:
    306
    Best Answers:
    0
    Trophy Points:
    315
    #103
    goscript, Jan 23, 2007 IP
  4. MikeSwede

    MikeSwede Peon

    Messages:
    601
    Likes Received:
    16
    Best Answers:
    0
    Trophy Points:
    0
    #104
    Google knows about it since they probably are paying research money for it.
    It wouldn't be too bad if it was used for caching but nowhere should cob-web.org show up. NOWHERE! It should not be in links, titles, in SERP's!
    It seems that Google strated showing results from cache servers and treat it as real sites or subdomains to cob-web.org.
    Anybody using sub-domains? I am guessing that the way they have to fix it is to disregard every sub domain OR they have to manually go in and remove cob-web.org from SERP's.
    Google seems not too worried about it yet but it could be a huge problem if it isn't already. Maybe we just need to accept that our domains have been taken over by a cob web, right?
     
    MikeSwede, Jan 23, 2007 IP
  5. NetMidWest

    NetMidWest Peon

    Messages:
    1,677
    Likes Received:
    151
    Best Answers:
    0
    Trophy Points:
    0
    #105
    http://www.factbites.com/topics/IME shows a non-clickable url to a cob-webbed page.

    It looks to me like mmakers.org has scraped content via CobWeb/3.1 kupl1.ittc.ku.edu
    Dang those scrapers, saving someone bandwidth... :D

    http://www.google.com/search?q="Now...eeling+imprisoned+and+bad."&safe=off&filter=0

    And pushing down the rankings of so many other sites using the same article... ;)

    The example I gave before: http://www.google.com/search?q=tight+shiny+clothes as showing a cob-web url ranking well is not in the top 50 at last check.
     
    NetMidWest, Jan 23, 2007 IP
  6. MattD

    MattD Peon

    Messages:
    161
    Likes Received:
    5
    Best Answers:
    0
    Trophy Points:
    0
    #106
    Guys I think you are taking this far too seriously. I severely doubt that this is an attempt to "hijack" your site, nor that it is an "out of control" experiment (if nothing else turn it off...)

    If you are really bothered about people accidentally visiting the cached version (fairly unlikely I'd say) just put some javascript in your site's header to redirect, e.g.

    
    <script type='text/javascript'>
    var s = new String(window.location);
    if (s.indexOf("cob-web.org") >= 0) {
       window.location = "http://www.myoriginalsite.com";
    }
    </script> 
    Code (markup):
    The javascript will get cached along with everything else.
     
    MattD, Jan 23, 2007 IP
  7. mcfox

    mcfox Wind Maker

    Messages:
    7,526
    Likes Received:
    716
    Best Answers:
    0
    Trophy Points:
    360
    #107
    One thing is for sure - IP blocking won't provide a solution. Have a look at this page: http://www.cob-web.org/status.html and click on any link and choose Meridian Status - e.g. http://planetlab-9.cs.princeton.edu:3865/

    See how many nodes are at work? Try blocking them all, and the ones that get added.

    What's interesting are the instructions on how to configure your browser to proxy through cob-web.org - http://www.cs.cornell.edu/people/egs/beehive/cobweb/instructions.php

    and how the technology actually works -
    CobWeb operates as a ring of cooperative caching proxy servers, each of which is capable of serving any HTTP request. When web objects are requested, they are fetched from their origin servers and inserted into the system. Through an analysis of web object popularity, size, and update rate, CobWeb then computes an optimal replication strategy for each object to provide low lookup latency.

    Get a glimpse of the future fellow webmasters.
     
    mcfox, Jan 23, 2007 IP
  8. NetMidWest

    NetMidWest Peon

    Messages:
    1,677
    Likes Received:
    151
    Best Answers:
    0
    Trophy Points:
    0
    #108
    And the url in window.location will get changed along with everything else as well... resulting in a loopback. :D
     
    NetMidWest, Jan 23, 2007 IP
  9. mcfox

    mcfox Wind Maker

    Messages:
    7,526
    Likes Received:
    716
    Best Answers:
    0
    Trophy Points:
    360
    #109
    This is the future architecture of the web you are witnessing. It's called CoDoNS and it's about Replacing the DNS Hierarchy with Peers.
     
    mcfox, Jan 23, 2007 IP
  10. cormac

    cormac Peon

    Messages:
    3,662
    Likes Received:
    222
    Best Answers:
    0
    Trophy Points:
    0
    #110
    Very true Mc Fox but the cob-web is only one of many others taking part in PlanetLab. Here is some old news here and here.
     
    cormac, Jan 23, 2007 IP
  11. MikeSwede

    MikeSwede Peon

    Messages:
    601
    Likes Received:
    16
    Best Answers:
    0
    Trophy Points:
    0
    #111
    And MC said "George, I’ll check out cob-web.org. Man, that’s a slow site."
    Guess he doesn't know it's a cache server and that google uses it
     
    MikeSwede, Jan 23, 2007 IP
  12. MikeSwede

    MikeSwede Peon

    Messages:
    601
    Likes Received:
    16
    Best Answers:
    0
    Trophy Points:
    0
    #112
    but all these servers and whatever they do should be transparent and not show up in search results at all with rewritten URL's.
    I still think that something went wrong and Google indexed the wrong things and gave a lot of sites a penalty for duplicate content.
    It's like a virus created at a lab that shouldn't have got outside the walls but did anyway and now it's spreading all over the world.
     
    MikeSwede, Jan 23, 2007 IP
  13. john269

    john269 Notable Member

    Messages:
    6,229
    Likes Received:
    116
    Best Answers:
    0
    Trophy Points:
    235
    #113
    I don't know enough about it, but what ever happens I hope the following doesn't happen.

    As this site becomes alot bigger and every time someone uses it and visits a site it will mean that all the links contained within that site will have a cob-web domain attached to it. Now if you spam or are believed to have spammed or broke Googles Webmaster Guidelines they penalize you from their search results. Some people get hurt when they are not in the wrong. That doesn't matter so much if your site has traffic coming from other sites accrossed the web.

    Now if Google has control over that aswell as they are making the links on everyones site something like: domain.com.cob-web.org, then I have a feeling that Google will also do something so that it also makes them links either unclickable or will do it so they redirect to their own site or something.

    How I see it is this site is messing around with our sites linking structure and we have not given premission to do this.

    Although, I would just like to say that I have only mentioned Google above as people say here that they have invested money into this project or may have invested money into it and Google are the people that seem to be doing most to change the web and also try and own it.
     
    john269, Jan 23, 2007 IP
  14. NetMidWest

    NetMidWest Peon

    Messages:
    1,677
    Likes Received:
    151
    Best Answers:
    0
    Trophy Points:
    0
    #114
    This should work for proxies that rewrite the url in the same manner:
    Code removed at request of poster due to an error.  The corrected one is here:
    http://forums.digitalpoint.com/showpost.php?p=2181382&postcount=125
    
    Code (markup):
    Change the number 25 to the number of characters in your full url, http through the end of your tld. Change the 26 to that number plus one. IF it reads a period at the end of your domain, it will redirect back to your homepage. You should have a 301 redirect in your .htaccess from non-www to www or vice-versa as well.

    It might loop if the visitor has set up their browser to use the proxy... I don't know.
     
    NetMidWest, Jan 23, 2007 IP
  15. MikeSwede

    MikeSwede Peon

    Messages:
    601
    Likes Received:
    16
    Best Answers:
    0
    Trophy Points:
    0
    #115
    MikeSwede, Jan 23, 2007 IP
  16. MikeSwede

    MikeSwede Peon

    Messages:
    601
    Likes Received:
    16
    Best Answers:
    0
    Trophy Points:
    0
    #116
    Google think this is a new trick to fool them? What exactly are we doing here? Just redirecting a subdomain from another site you your own? Will Googlebot read it? What happens if your site is indexed and you have this in your files when you are crawled?
     
    MikeSwede, Jan 23, 2007 IP
  17. NetMidWest

    NetMidWest Peon

    Messages:
    1,677
    Likes Received:
    151
    Best Answers:
    0
    Trophy Points:
    0
    #117
    It won't do a thing to Google, it's javascript. Someone might trip the qualifier looking at the cache in Google or something, but it's client-side. I see no problem for Googlebot.

    Google is going to pick up the cob-web urls as it crawls the web, but with no content to index. Cob-web.org has blocked that. Since Google does apply importance to anchor text, it will list those urls with titles of the anchor text it was found under. It will get some freshboost, but with no caching by Google and no content, it will dive into the bottom quickly.

    I'll agree that someone linking to you with a cob-webbed url does not give you credit for the link. It can mess up your forms, java, etc. But Google is not penalizing for these urls being found.

    From the Yahoo example you gave, Mike, I would wonder what is wrong with Yahoo in that they ARE caching the content, though the cache looks like a real mess they might not have picked up anyway... as with other examples given in this thread. Perhaps cob-web needs to look at how yahoo's crawlers are being fed content.
     
    NetMidWest, Jan 23, 2007 IP
  18. Darkhodge

    Darkhodge Well-Known Member

    Messages:
    2,111
    Likes Received:
    76
    Best Answers:
    1
    Trophy Points:
    185
    #118
    Apparently:

    From www.cob-web.org

    *Edit* Sorry I didn't realize this thread was quite old - I didn't read the full thread before posting... Oops :eek:
     
    Darkhodge, Jan 23, 2007 IP
  19. mcfox

    mcfox Wind Maker

    Messages:
    7,526
    Likes Received:
    716
    Best Answers:
    0
    Trophy Points:
    360
    #119
    Agreed, the new infrastructure should of course, be transparent.

    I'm not sure if anything has gone wrong or if some testing is underway or what's going on exactly but it does give a nice glimpse at the future.
    It shouldn't make any real difference. I have no doubt that some canny webmasters will be able to use the cob-web url to give their sites a boost.
     
    mcfox, Jan 23, 2007 IP
  20. Pat Gael

    Pat Gael Banned

    Messages:
    1,331
    Likes Received:
    68
    Best Answers:
    0
    Trophy Points:
    0
    #120
    Is Google hacked as well?

    I got a "problems" message earlier at gmail and now I can't get Google or any Google related site loaded.
     
    Pat Gael, Jan 23, 2007 IP