Page 1 of 1

PDF parser

Posted: Thu Apr 02, 2009 7:09 am
by rfeio
Hi,

My site has several PDF documents that users can download. However, I would like to be able to index the content of those pdf files, so that the users could do a search for a given argument, and the site would return which pdf files would be relevant.

I was thinking that maybe the best way of doing this would be by parsing the content of the pdf files and save it on a MySQL table. When the user would do the search, the script would look in the table and return the pdf file names relevant for the search.

I would need some guidance on how I could parse a PDF file since I've never done this before. Also, would this be the best way of achieving what I want?

Thanks!

Rfeio