Hi,
My site has several PDF documents that users can download. However, I would like to be able to index the content of those pdf files, so that the users could do a search for a given argument, and the site would return which pdf files would be relevant.
I was thinking that maybe the best way of doing this would be by parsing the content of the pdf files and save it on a MySQL table. When the user would do the search, the script would look in the table and return the pdf file names relevant for the search.
I would need some guidance on how I could parse a PDF file since I've never done this before. Also, would this be the best way of achieving what I want?
Thanks!
Rfeio
PDF parser
Moderator: General Moderators