Quote:
Originally Posted by sealbeater
I guess we'll have to agree to disagree. I've converted PDF to editable formats quite readily and easily. Only problem is if the PDF is composed of images and even then, it's easy enough to convert to html and from there, I assuming to epub. I still haven't looked into a suitable way to create an epub from cmd line.
None of this is news. Nor is it really relevant.
As an aside, you can't write malware in pdf. Nor do I find it likely you can write games in "PDF". You can encapsulate code but I would like to see malware **written** in *pdf*, whatever that means. I'm aware of PostScript being a programming language but not PDF.
I'm assuming you've never used the Poppler tools then. That or your gift for hyperbole is unmatched.
pdftotxt ring a bell? Although personally, I would probably try to convert to XML. to have best chance of perserving italics and bolds.
|
pdftotext is a program that somebody already spent lots of time creating. Using that is hardly writing a pdf to epub converter. That is using somebody else's work to extract raw text and then running an epub encoder, keeping all the imperfections introduced by the text extractor.
Not at all comparable to what was requested.