![]() |
#1 |
Junior Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4
Karma: 6168
Join Date: Jan 2013
Device: none
|
pdf2htmlEX: A precise PDF to HTML converter
Many people describe it as a perfect tool. It's a pity that it doesn't have a windows version. I hope maybe people here can compile one?
here is the demo link, check them out for yourself. Typography http://coolwanglu.github.io/pdf2htmlEX/demo/geneve.html Formula http://coolwanglu.github.io/pdf2htmlEX/demo/cheat.html Magazine http://coolwanglu.github.io/pdf2html...ssue65_en.html http://coolwanglu.github.io/pdf2htmlEX/demo/demo.html |
![]() |
![]() |
![]() |
#2 |
frumious Bandersnatch
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,543
Karma: 19001583
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
|
It seems to use almost exclusively javascript code. The actual page is something like this:
Code:
<body> <div id="sidebar"> <div id="outline"> </div> </div> <div id="page-container"> </div> </body> |
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Junior Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4
Karma: 6168
Join Date: Jan 2013
Device: none
|
Why? Since it produces html, then we can convert the html to whatever we like, right?
|
![]() |
![]() |
![]() |
#4 |
frumious Bandersnatch
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,543
Karma: 19001583
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
|
But the HTML is just what I showed, all the content is generated through javascript, and I guess converting this javascript to something else is quite similar to converting from the original PDF.
|
![]() |
![]() |
![]() |
#5 |
Digital Amanuensis
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 727
Karma: 1446357
Join Date: Dec 2011
Location: Turin, Italy
Device: Several eReaders and tablets
|
I agree. That tool essentially converts "objects in PDF format" to "objects in HTML format", where "object" more or less means "a box containing text with (x, y) coordinates".
Try scaling the font in any of the examples (most notably, the math cheatsheet one), and you will get the idea of why this approach is not "right" if you really want to produce a (reflowable) eBook. Last edited by AlPe; 06-09-2013 at 11:33 AM. |
![]() |
![]() |
Advert | |
|
![]() |
#6 |
Fuzzball, the purple cat
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,298
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Compiling for Windows requires a port of the Poppler PDF rendering library. You might contact the author of PaperCrop (Taesoo Kwon) since he has ported Poppler for that.
|
![]() |
![]() |
![]() |
#7 |
Junior Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1
Karma: 6000
Join Date: Aug 2011
Location: Shanghai, China
Device: ipad
|
![]() Last edited by Alexander Turcic; 08-20-2013 at 06:41 AM. Reason: approved |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
best converter from pdf to html ? | NASCARaddicted | ePub | 3 | 02-11-2010 10:47 AM |
HTML to PDF batch converter | sputnik | 3 | 07-07-2009 04:25 AM | |
Yet Another Gutenberg Book/HTML converter | FangornUK | Sony Reader | 59 | 05-01-2009 10:15 AM |
HTML-to-Epub Online Converter available | Nate the great | ePub | 16 | 02-07-2009 07:52 PM |
html to bbeb converter ? | bugsbunny14 | Sony Reader | 10 | 11-07-2008 10:50 PM |