can anyone tell me how to retrieve file summary info from .doc and .pdf files. i'm trying to get title, subject, author, category, comments etc?? much like ID3 tags fro mp3's
thanxs
file summary info
Moderator: General Moderators
- feyd
- Neighborhood Spidermoddy
- Posts: 31559
- Joined: Mon Mar 29, 2004 3:24 pm
- Location: Bothell, Washington, USA
both the examples you gave are binary file formats. DOC format has not been disclosed, so extracting metadata could be challenging without some helper utilities. You may be able to ween the information out of other applications like OpenOffice.Org for it. PDF file specification is available from Adobe. Some of the PDF libraries built for PHP may be able to extract the metadata natively, but you may need to dive through the binary data anyways depending on what exactly you want from each...