Text file comparison - algorithm
Posted: Fri Jul 06, 2007 9:53 am
I am looking for a way to determine if a text file b is an edited version of a text file a.
If there is not a function already, here are some thoughts:
1. Two files have almost same length (within 2% deviation or some small arbitrary number)
2. Two files contain some rare words.
Any suggestion would be appreciated.
If there is not a function already, here are some thoughts:
1. Two files have almost same length (within 2% deviation or some small arbitrary number)
2. Two files contain some rare words.
Any suggestion would be appreciated.