![]() |
#16 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,382
Karma: 27756918
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
The pdfreflow.so module is a C module that takes a PDF and returns an XML. The XML is not quite PDF draw commands (the C code does a little bit of cleanup/consolidation).
The calibre.ebooks.pdf.reflow python module then takes that XML file and tries to "reflow" it (i.e. do things like unwrap analysis, identifying structure and so on). So the best place for you to do hacking in in calibre.ebooks.pdf.reflow |
![]() |
![]() |
![]() |
Tags |
conversion, linebreak, pdf, unwrap |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
possible bug about.pdf Unwrap | zambosky | Calibre | 5 | 06-20-2010 09:53 AM |
Line Spacing on PDF to Epub conversion | poodlemama | Calibre | 2 | 05-03-2010 08:28 PM |
PDF Line Un-Wrap Factor bug? | jotekman | Calibre | 2 | 03-15-2010 11:43 AM |
PDF line spacing | jjansen | Calibre | 3 | 03-08-2010 11:46 AM |
PDF to ePub (New line problem) | Dark123 | Calibre | 3 | 02-13-2010 08:41 PM |