Page 2 of 2

Posted: Sat Mar 04, 2006 9:04 pm
by a94060
jshpro2 wrote:
a94060 wrote:doesnt d11wtq run the site?
No single person runs the site, every member here runs the site. The mods vote on important issues though.


What I would recommend is storing a md5 hash of the message in the database, and then checking if the md5 hash of the current message matches any other message. This will knock out exact dupes but you still have that 10% difference and the issue that jcart brought up. Could you tell us how this is going to be used because there are different algorithms for example soundex that could be used here..
That seems to be a great idea. wouldnt the hashing still take time if it were thousands of characters long? it would cause a significant load right?(i dont know how fast these commands but im giving my 2cents :idea:

Posted: Sat Mar 04, 2006 9:04 pm
by Chris Corbyn
a94060 wrote:
scrotaye wrote:d11wtq isn't really a mod........ he just acts like it :lol:
doesnt d11wtq run the site?
I'm just one of you guys ;) (and now I'm going to bed...)