1. Advertising
    y u no do it?

    Advertising (learn more)

    Advertise virtually anything here, with CPM banner ads, CPM email ads and CPC contextual links. You can target relevant areas of the site and show ads based on geographical location of the user if you wish.

    Starts at just $1 per CPM or $0.10 per CPC.

Plagiarism Checking Tool Help

Discussion in 'PHP' started by megeh_09, May 10, 2013.

  1. #1
    I am building a plagiarsm checking tool for my company but have problems on calculating the uniqueness factor.

    Calculation works by searching the snippet into Google using GSERP. The script then checks if there where results then snippet is not unique.

    This is my code;

    . . .

    $snippet = '"' . join(" ", array_slice($contentArray, $start, $limit)) . '"';

    $start += $limit;
    $end += $limit;
    $counter++;

    $url = '';
    $lang = 'en';

    $gserp = (g_serp($snippet, $url, $lang));
    $gserpCount = count($gserp);

    . . .

    . . .

    error_reporting(E_ALL ^ E_NOTICE);

    //helper function -- file_get_contents using curl
    function file_get_contents_curl($url, $referer = '', $ua = '') {
    $ch = curl_init($url);

    curl_setopt($ch, CURLOPT_HEADER, FALSE);
    curl_setopt($ch, CURLOPT_RETURNTRANSFER, TRUE);

    if ($referer != '') {
    curl_setopt($ch, CURLOPT_REFERER, $referer);
    }

    if ($ua != '') {
    curl_setopt($ch, CURLOPT_USERAGENT, $ua);
    }
    SEMrush
    curl_setopt($ch, CURLOPT_FOLLOWLOCATION, TRUE);
    curl_setopt($ch, CURLOPT_TIMEOUT, 30);

    $data = curl_exec($ch);

    curl_close($ch);

    return $data;
    }

    //this is the main function
    function g_serp($keyword, $url, $lang = 'en') {
    $results = array();
    $g_url = 'http://ajax.googleapis.com/ajax/services/search/web?v=1.0&q=' . urlencode($keyword) .
    '&rsz=large&userip=' . $_SERVER['REMOTE_DDR'] . '&hl=' . $lang;

    for ($i = 0; $i < 64; $i+=8) {
    $start = $i;
    $referer = $_SERVER['HTTP_REFERER']; //change this into your real domain
    $rawdata = file_get_contents_curl($g_url . '&start=' . $start, $referer, $_SERVER['HTTP_USER_AGENT']);
    $decoded = json_decode($rawdata, TRUE); //decode as assoc array

    if (is_array($decoded['responseData']['results'])) {
    $pos = $start;

    foreach ($decoded['responseData']['results'] as $result) {
    //if (substr_count(strtolower($result['url']), $url)) {
    // $GLOBALS['index'] = $pos + 1;
    // }

    $res['position'] = $pos + 1;
    $res['title'] = $result['titleNoFormatting'];
    $res['url'] = $result['unescapedUrl'];

    array_push($results, $res);

    $pos++;
    }
    }
    }

    return $results;
    }

    . . .


    Anyone have any idea what I may be doing wrong or how best I can go about this?
     
    megeh_09, May 10, 2013 IP
    SEMrush
  2. EricBruggema

    EricBruggema Well-Known Member

    Messages:
    1,731
    Likes Received:
    26
    Best Answers:
    13
    Trophy Points:
    175
    #2
    please use the [ code ] or [ php ] tags for displaying your code.
     
    EricBruggema, May 10, 2013 IP