remove text from PDF

Ye' old general discussion board. Basically, for everything that isn't covered elsewhere. Come here to shoot the breeze, shoot your mouth off, or whatever suits your fancy.
This forum is not for asking programming related questions.

Moderator: General Moderators

Post Reply
mesh2005
Forum Newbie
Posts: 3
Joined: Mon Jan 30, 2006 2:06 am

remove text from PDF

Post by mesh2005 »

i have some pdf encoded as image on text (hidden text), now i found out that many contain the wrong text.
is there any way to remove text objects from PDF to avoid the slow re-encoding process?
could this be done using a free toolkit?
thanks
User avatar
Chris Corbyn
Breakbeat Nuttzer
Posts: 13098
Joined: Wed Mar 24, 2004 7:57 am
Location: Melbourne, Australia

Post by Chris Corbyn »

Not sure I understand. You want to remove parts of a PDF without re-encoding it? It can't be possible surely since you'd have to decode the PDF to do it. PDF encoding isn't slow in my experience using PHP. Have you played around with the PDF class by R & OS ? :)

http://ros.co.nz/pdf
mesh2005
Forum Newbie
Posts: 3
Joined: Mon Jan 30, 2006 2:06 am

Post by mesh2005 »

thanks a lot for your reply
i tried using PHP to generate PDF using the class R & O but it can't be used if you want to generate image on text pdf (hidden text) so i generated these pdfs using iText.
i don't want to do it again, i'm asking is it possible to parse the pdf and to remove any text object? if so, using which tool?
thanks a lot for your time
Post Reply