Plagiarism checker in PHP ?

PHP programming forum. Ask questions or help people concerning PHP code. Don't understand a function? Need help implementing a class? Don't understand a class? Here is where to ask. Remember to do your homework!

Moderator: General Moderators

Locked
dewshare
Forum Newbie
Posts: 4
Joined: Fri Jan 21, 2011 3:51 am

Plagiarism checker in PHP ?

Post by dewshare »

Hi

I want to make a php function that Takes Text as input and then check that text on google. if it find results return True else false,

Is it possible ? please give me some help to start work on it.

i shall be thank full to you
Peter Kelly
Forum Contributor
Posts: 143
Joined: Fri Jan 14, 2011 5:33 pm
Location: England
Contact:

Re: Plagiarism checker in PHP ?

Post by Peter Kelly »

Everythings possible I think you might wanna look into the google api http://code.google.com/apis/ on there you might be able to find an api to search google with but how you get the results i wouldnt know.
User avatar
greyhoundcode
Forum Regular
Posts: 613
Joined: Mon Feb 11, 2008 4:22 am

Re: Plagiarism checker in PHP ?

Post by greyhoundcode »

As Peter says you can (no doubt) do this with Google, but for what it is worth I found Yahoo's search APIs to be very straightforward to implement. Are you developing something similar to Copyscape?
dewshare
Forum Newbie
Posts: 4
Joined: Fri Jan 21, 2011 3:51 am

Re: Plagiarism checker in PHP ?

Post by dewshare »

Yes I am developing the same as CopyScape but I think copyscape does not work with google search.

I check this by google. Google return me 1 result of a paragraph but copyscape return me 7 results. so I believe copyscape has some thing different technique
User avatar
greyhoundcode
Forum Regular
Posts: 613
Joined: Mon Feb 11, 2008 4:22 am

Re: Plagiarism checker in PHP ?

Post by greyhoundcode »

If all you are doing is copying and pasting a paragraph to use in a regular Google search that doesn't surprise me.
josh
DevNet Master
Posts: 4872
Joined: Wed Feb 11, 2004 3:23 pm
Location: Palm beach, Florida

Re: Plagiarism checker in PHP ?

Post by josh »

When I used to plagiarize in grade school / high school I'd use a thesaurus to change words around, I'd rewrite every other sentence in my own words, leave whole paragraphs out, change the order of sentences etc.

Your algorithm would have to basically do the inverse of all those operations. The more I scramble it the harder the problem is for you to solve until it eventually becomes NP-complete (That is, the time required to solve the problem using any currently known algorithm increases very quickly as the size of the problem grows) Not even some humans could solve the puzzle of who wrote Josh's papers ;-)

Even copyscape fails. It shows me sites my text ads are running on, I don't consider that plagiarism! lol. If they embed my whole text and put a link back they aren't plagiarizing anything. This problem is NP-complex and a whole research paper could be written about why it'll never work 100% (I'd plagiarize the research paper though :twisted: )
User avatar
greyhoundcode
Forum Regular
Posts: 613
Joined: Mon Feb 11, 2008 4:22 am

Re: Plagiarism checker in PHP ?

Post by greyhoundcode »

+1 what Josh said.

Even in looking for verbatim excerpts from your or someone else's work I don't think copying and pasting paragraphs into Google is the way to go; after all, generally people don't enter complete paragraphs when they are conducting searches, they enter key terms that are relevant to what they are looking for.

If I were creating something akin to Copyscape, I think I would take the source URL and use something like the BOSS API (or equivalent from Google, etc) to help derive a list of relevant key words (ie, the key words people are using to find the source material). Then I might conduct fresh searches with those same keywords, or variations of them.
topnotch
Forum Newbie
Posts: 1
Joined: Wed Mar 07, 2012 10:16 pm

Re: Plagiarism checker in PHP ?

Post by topnotch »

Since copyscape is leader in this industry its hard to find an alternative to it with that quality but i know a plagiarism checker tool that is developed in PHP using google search engine to check the duplicate content. hope that will help a bit.
Locked