PDF & DOCX binary data searching
Posted: Tue Jul 15, 2008 9:48 am
I posted a question about this the other day in the php section, but it has since been put 4+ pages back without an answer. So, I was hoping for any solution really. Here is my situation. I allow users to upload .doc, .docx, .pdf, .rtf, .txt files to my database and I then pull them out one at a time from my db, as binary, and need to search through them for specific text. Now, the unencrypted file types are a piece of cake but so far I have not been able to find anything that will allow me to take in a string from the user, either encrypt it for pdf or docx and then search for that encrypted string in the binary data or directly search for the input string in the docx or pdf files using functions.
Anyone have any ideas on how this could be done, a place where I can figure out EXACTLY how docx's and pdf's are encrypted (so i can encrypt the string manually), or any other help in general?
Anyone have any ideas on how this could be done, a place where I can figure out EXACTLY how docx's and pdf's are encrypted (so i can encrypt the string manually), or any other help in general?