Plagiarism Checking Tool Help

PHP programming forum. Ask questions or help people concerning PHP code. Don't understand a function? Need help implementing a class? Don't understand a class? Here is where to ask. Remember to do your homework!

Moderator: General Moderators

Post Reply
megeh_09
Forum Newbie
Posts: 2
Joined: Fri May 10, 2013 4:37 am

Plagiarism Checking Tool Help

Post by megeh_09 »

I am building a plagiarsm checking tool for my company but have problems on calculating the uniqueness factor.

Calculation works by searching the snippet into Google using GSERP. The script then checks if there where results then snippet is not unique.

This is my code;

. . .
***** Please use the PHP Code tag for source code *****

Code: Select all

$snippet = '"' . join(" ", array_slice($contentArray, $start, $limit)) . '"';

$start += $limit;
$end += $limit;
$counter++;

$url = '';
$lang = 'en';

$gserp = (g_serp($snippet, $url, $lang));
$gserpCount = count($gserp);

. . .

. . .

error_reporting(E_ALL ^ E_NOTICE);

//helper function -- file_get_contents using curl
function file_get_contents_curl($url, $referer = '', $ua = '') {
    $ch = curl_init($url);

    curl_setopt($ch, CURLOPT_HEADER, FALSE);
    curl_setopt($ch, CURLOPT_RETURNTRANSFER, TRUE);

    if ($referer != '') {
        curl_setopt($ch, CURLOPT_REFERER, $referer);
    }

    if ($ua != '') {
        curl_setopt($ch, CURLOPT_USERAGENT, $ua);
    }

    curl_setopt($ch, CURLOPT_FOLLOWLOCATION, TRUE);
    curl_setopt($ch, CURLOPT_TIMEOUT, 30);

    $data = curl_exec($ch);

    curl_close($ch);

    return $data;
}

//this is the main function
function g_serp($keyword, $url, $lang = 'en') {
    $results = array();
    $g_url = 'http://ajax.googleapis.com/ajax/services/search/web?v=1.0&q=' . urlencode($keyword) .
            '&rsz=large&userip=' . $_SERVER['REMOTE_DDR'] . '&hl=' . $lang;

    for ($i = 0; $i < 64; $i+=8) {
        $start = $i;
        $referer = $_SERVER['HTTP_REFERER']; //change this into your real domain
        $rawdata = file_get_contents_curl($g_url . '&start=' . $start, $referer, $_SERVER['HTTP_USER_AGENT']);
        $decoded = json_decode($rawdata, TRUE); //decode as assoc array

        if (is_array($decoded['responseData']['results'])) {
            $pos = $start;

            foreach ($decoded['responseData']['results'] as $result) {
                //if (substr_count(strtolower($result['url']), $url)) {
//                    $GLOBALS['index'] = $pos + 1;
//                }
                
                $res['position'] = $pos + 1;
                $res['title'] = $result['titleNoFormatting'];
                $res['url'] = $result['unescapedUrl'];
                
                array_push($results, $res);

                $pos++;
            }
        }
    }
    
    return $results;
}
. . .


Anyone have any idea what I may be doing wrong or how best I can go about this?
jasonporter
Forum Newbie
Posts: 1
Joined: Wed Apr 11, 2018 9:56 am

Re: Plagiarism Checking Tool Help

Post by jasonporter »

Here you go. Plagiarizm checker at https://paperleaf.ca/plagiarizm-checker/ is the best for me. I used it for my project and it helped a lot. Let's hope that it will help you as well.
Post Reply