Page 1 of 1
remove text from PDF
Posted: Mon Jan 30, 2006 2:09 am
by mesh2005
i have some pdf encoded as image on text (hidden text), now i found out that many contain the wrong text.
is there any way to remove text objects from PDF to avoid the slow re-encoding process?
could this be done using a free toolkit?
thanks
Posted: Mon Jan 30, 2006 10:13 am
by Chris Corbyn
Not sure I understand. You want to remove parts of a PDF without re-encoding it? It can't be possible surely since you'd have to decode the PDF to do it. PDF encoding isn't slow in my experience using PHP. Have you played around with the PDF class by R & OS ?
http://ros.co.nz/pdf
Posted: Wed Feb 01, 2006 1:14 am
by mesh2005
thanks a lot for your reply
i tried using PHP to generate PDF using the class R & O but it can't be used if you want to generate image on text pdf (hidden text) so i generated these pdfs using iText.
i don't want to do it again, i'm asking is it possible to parse the pdf and to remove any text object? if so, using which tool?
thanks a lot for your time