View Single Post
Old 01-09-2012, 05:00 PM   #5
frostschutz
Linux User
frostschutz ought to be getting tired of karma fortunes by now.frostschutz ought to be getting tired of karma fortunes by now.frostschutz ought to be getting tired of karma fortunes by now.frostschutz ought to be getting tired of karma fortunes by now.frostschutz ought to be getting tired of karma fortunes by now.frostschutz ought to be getting tired of karma fortunes by now.frostschutz ought to be getting tired of karma fortunes by now.frostschutz ought to be getting tired of karma fortunes by now.frostschutz ought to be getting tired of karma fortunes by now.frostschutz ought to be getting tired of karma fortunes by now.frostschutz ought to be getting tired of karma fortunes by now.
 
frostschutz's Avatar
 
Posts: 2,282
Karma: 6123806
Join Date: Sep 2010
Location: Heidelberg, Germany
Device: none
PDF does not necessarily have a concept of indentation. Text (every line, or every word, or even every letter) is positioned using x,y coordinates on a page. Visually this is fine as you as a human recognize it, but it's a nightmare when it comes to machine-readability.

Enable the Debug feature in Calibre conversion and check the HTML of the first step (input html). If that is properly indented or at least has any kind of indication of indentation or different formatting for indented lines, then you might be able to convert it by using a HTML editor on the input file and search&replace those indicators. That's what I did with my last PDF file and it worked well.
frostschutz is offline   Reply With Quote