View Single Post
Old 07-26-2010, 06:14 PM   #3
user_none
Sigil & calibre developer
user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.
 
user_none's Avatar
 
Posts: 2,487
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
Quote:
Originally Posted by bjo View Post
But that's what Calibre is supposed to do, yes? If I've had to do a lot of post-processing, does that just mean I'm using it wrong?
You're not using it wrong. Structure detection is still a work in progress for some formats.

Also it's not as easy as you might think. Take the TXT input for instance. There is no differentiation other than new lines. Detecting the structure becomes very hard. There are heuristics that can be used but they quickly fall short when you move away from the language the heuristic is based on. We could do language specific processing... As Kovid is fond of saying patches are welcome. Python isn't LISP but it's a very easy and fun language.
user_none is offline   Reply With Quote