Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre > Conversion

Notices

Reply
 
Thread Tools Search this Thread
Old 03-14-2014, 07:07 AM   #1
vanhout
Junior Member
vanhout began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Jan 2012
Device: Nook Touch
About the algorithm "heuristic"

Hi,

How to find more information about the algorithm "heuristic" used by Calibre?
Can we change it?

thank you in avanced.
vanhout is offline   Reply With Quote
Old 03-14-2014, 07:49 AM   #2
itimpi
Wizard
itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.
 
Posts: 4,552
Karma: 950151
Join Date: Nov 2008
Device: Sony PRS-950, iphone/ipad (Marvin/iBooks/QuickReader)
Quote:
Originally Posted by vanhout View Post
Hi,

How to find more information about the algorithm "heuristic" used by Calibre?
Can we change it?

thank you in avanced.
I think that you would have to look at the Calibre source code to find out more information on exactly what is happening within that option.

Other than the few settings you can set on the dialog that is displayed to tweak its behaviour, you can only change the algorithm by altering the Calibre source code.
itimpi is offline   Reply With Quote
Old 03-14-2014, 02:39 PM   #3
vanhout
Junior Member
vanhout began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Jan 2012
Device: Nook Touch
Quote:
Originally Posted by itimpi View Post
you can only change the algorithm by altering the Calibre source code.
Thanks for your reply itimpi,

If the algorithm is directly implemented in the code then little can be done.

The problem is that the algorithm only seems to work well with English documents, in French for example works pretty bad.
vanhout is offline   Reply With Quote
Old 03-14-2014, 04:38 PM   #4
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 20,459
Karma: 26645808
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by vanhout View Post
Thanks for your reply itimpi,

If the algorithm is directly implemented in the code then little can be done.

The problem is that the algorithm only seems to work well with English documents, in French for example works pretty bad.
@vanhout - If you can elaborate on what this - in French for example works pretty bad - actually means in practice, perhaps with some examples, then maybe something can be done about it.

I wonder whether there is an 'algorithm' as such, beyond what's required to implement the various selectable options available in the dialogue, and as is documented here ==>> http://manual.calibre-ebook.com/conv...tic-processing

Quote:
Heuristic Processing provides a variety of functions which can be used to try and detect and correct common problems in poorly formatted input documents. Use these functions if your input document suffers from poor formatting.
BR
BetterRed is offline   Reply With Quote
Old 03-14-2014, 05:27 PM   #5
user_none
Sigil & calibre developer
user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.
 
user_none's Avatar
 
Posts: 2,488
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
Pretty much all of the heuristic options are a series of regular expressions that detect and change specific patterns. All of the developers who have worked on heuristic processing (including myself) either speak English as their primary or only language so it's understandable that it's English centric. Though some aren't necessarily language specific.
user_none is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
KT "Ghost covers/files" again at 670 books, "stale" image entries in firmware VirgoGirl Kobo Reader 4 04-06-2012 02:10 PM
Heuristic "Remove unnecessary hyphens" not working? therealjoeblow Conversion 2 03-06-2012 10:21 AM
Feature Request: configurable space setting for "Insert blank line" in "Look & Feel" therealjoeblow Calibre 15 07-25-2011 03:14 PM


All times are GMT -4. The time now is 09:29 PM.


MobileRead.com is a privately owned, operated and funded community.