|
|
#16 |
|
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,609
Karma: 28549044
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
The pdfreflow.so module is a C module that takes a PDF and returns an XML. The XML is not quite PDF draw commands (the C code does a little bit of cleanup/consolidation).
The calibre.ebooks.pdf.reflow python module then takes that XML file and tries to "reflow" it (i.e. do things like unwrap analysis, identifying structure and so on). So the best place for you to do hacking in in calibre.ebooks.pdf.reflow |
|
|
|
![]() |
| Tags |
| conversion, linebreak, pdf, unwrap |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| possible bug about.pdf Unwrap | zambosky | Calibre | 5 | 06-20-2010 10:53 AM |
| Line Spacing on PDF to Epub conversion | poodlemama | Calibre | 2 | 05-03-2010 09:28 PM |
| PDF Line Un-Wrap Factor bug? | jotekman | Calibre | 2 | 03-15-2010 12:43 PM |
| PDF line spacing | jjansen | Calibre | 3 | 03-08-2010 12:46 PM |
| PDF to ePub (New line problem) | Dark123 | Calibre | 3 | 02-13-2010 09:41 PM |