View Single Post
Old 04-16-2011, 03:36 PM   #86
telemetrics
Junior Member
telemetrics began at the beginning.
 
Posts: 8
Karma: 10
Join Date: Apr 2011
Device: IPad
Lightbulb Extract ISBN - Fantastic Feature. Further Suggestions

I just downloaded Calibre and was just wondering about this feature. Thanks a lot.

Feature 1: OCR
Is it possible to extract first and last 3/4 pages of an eBook and run this on an OpenSource (or Free) OCR.
http://code.google.com/p/tesseract-ocr/

Feature 2: Autorun "Download metadata and covers" for all files where ISBN was found.

Feature 3: Detect ISBN in File Name.
ISBN number in File Names are found in some cases. They may not have a the prefix of the string 'ISBN' but just direct number ISBN10 or 13. However we need to clean the special chars like Underscores and Square Brackets.

Feature 4: ReOrder Suggestion based on Name
Incase multiple ISBN numbers are found then we could show the options and let the user select one (in just one click). The Optional ISBN Numbers can be looked up and the titles and authors can be displayed next to it.
However these should be ordered based on the Distance from the Title of the option to the file name of the ebook.
http://en.wikipedia.org/wiki/Levenshtein_distance
telemetrics is offline   Reply With Quote