i have some pdf encoded as image on text (hidden text), now i found out that many contain the wrong text.
is there any way to remove text objects from PDF to avoid the slow re-encoding process?
could this be done using a free toolkit?
thanks
remove text from PDF
Moderator: General Moderators
- Chris Corbyn
- Breakbeat Nuttzer
- Posts: 13098
- Joined: Wed Mar 24, 2004 7:57 am
- Location: Melbourne, Australia
Not sure I understand. You want to remove parts of a PDF without re-encoding it? It can't be possible surely since you'd have to decode the PDF to do it. PDF encoding isn't slow in my experience using PHP. Have you played around with the PDF class by R & OS ? 
http://ros.co.nz/pdf
http://ros.co.nz/pdf
thanks a lot for your reply
i tried using PHP to generate PDF using the class R & O but it can't be used if you want to generate image on text pdf (hidden text) so i generated these pdfs using iText.
i don't want to do it again, i'm asking is it possible to parse the pdf and to remove any text object? if so, using which tool?
thanks a lot for your time
i tried using PHP to generate PDF using the class R & O but it can't be used if you want to generate image on text pdf (hidden text) so i generated these pdfs using iText.
i don't want to do it again, i'm asking is it possible to parse the pdf and to remove any text object? if so, using which tool?
thanks a lot for your time