Quote:
Originally Posted by patrik
Well, if I get a pdf with a clean page printout on each page, run it through Finereader, it would give me a useful epub (with some Sigil work).
It would be way easier to buy the paper book, but... why not...
(This is more driven by my personal curiosity and that I like to learn how-to-do stuff.)
|
If you really wanna learn something you could try using an HTML parser in a language like Python or Perl to process the HTML to remove all the span tags (or whatever else makes it fixed layout) while keeping the rest.