12-04-2013, 06:50 PM | #1 |
Zealot
Posts: 105
Karma: 414068
Join Date: Feb 2013
Device: iPad Pro, Kobo Aura One
|
Djvu: Extracting ISBN numbers from a large number of books?
Hi all,
I have a large number of old math books and they're all in djvu format and filenames are somewhat messy. I'd like to import them all into Calibre so I can categorize them and fix names etc. Is there an easy(ier) way to extract ISBN numbers other than converting them all to PDF and then running OCR on all of them? I'd really like to avoid that since that would take very long time, would produce large files, and there's still no guarantee that it would OCR all of the ISBNs properly. Option of last resort is of course to manually type all the ISBN numbers. I'd like to avoid that one of course Thanks for any tips! Mel |
12-05-2013, 12:33 PM | #2 |
Grand Sorcerer
Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
I guess it would depend on where the ISBN is stored in the file. This would typically be in some of the metadata. What viewer are you using to look at the file. Can it view the metadata? See DJVU in our wiki for some viewers you might try.
Dale |
12-06-2013, 04:33 PM | #3 |
Zealot
Posts: 105
Karma: 414068
Join Date: Feb 2013
Device: iPad Pro, Kobo Aura One
|
Thank you for a comment, Dale!
Unfortunately, these files don't seem to have any metadata. I've checked it with few viewers. For example, DjView shows this: I'm starting to believe that only some type of manual process is the only thing that will work. I'm looking at various OCR software packages to see if any of them can scan only first N pages but then I'll have to extract ISBN manually anyway… or maybe let Calibre plugin extract it somehow. I'm still thinking about the best way to go about this. |
12-06-2013, 08:34 PM | #4 |
Grand Sorcerer
Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
Where do you see the ISBN data? On a title page, near the beginning of the book? I suspect you will need to open the book to see it.
Dale |
12-06-2013, 08:48 PM | #5 |
Grand Sorcerer
Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
You might try the tools at http://djvu.sourceforge.net/index.html. They have a djvused tool that is somewhat like Unix sed to extract data from a file.
Dale |
12-09-2013, 01:27 PM | #6 |
Guru
Posts: 691
Karma: 3026110
Join Date: Dec 2008
Location: Lancashire, U.K.
Device: BeBook 1, BeBook Pure, Kobo Glo, (and HD),Energy Sistem EReader Pro +
|
I've also found DJVUTOY a useful piece of software for manipulating DJVUs. :
http://www.comicer.com/stronghorse/s...jVuToy_eng.zip is an English version, unfortunately the main site is in Chinese. With it you can split and merge DJVUs, insert bookmarks and manipulate hidden text by exporting it, editing it then re-importing it. I used an early version for creating effectively a clickable TOC for a number of DJVUs. Documentation is sparse so a bit of experimentation is needed to get the best out of it. BobC |
12-21-2013, 07:11 PM | #7 |
Fuzzball, the purple cat
Posts: 1,273
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
@MelBr -- Any chance a sample .djvu file could be posted?
|
04-13-2014, 03:35 AM | #8 | |
Junior Member
Posts: 4
Karma: 12584
Join Date: Apr 2014
Device: none
|
Quote:
check https://www.mobileread.com/forums/showthread.php?t=237519&highlight=[GUI+Plugin]+Extract+ISBN |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Export ISBN numbers for import to Google books | IMFletch | Calibre | 10 | 06-22-2017 11:06 AM |
Aura How does aura handle large numbers of books? | jlynton | Kobo Reader | 4 | 09-20-2013 03:30 AM |
Large number of books on memory card | pwalker8 | Sony Reader | 8 | 03-24-2009 02:20 PM |
PRS-505 with large number of books? | murraypaul | Sony Reader | 22 | 07-08-2008 01:23 AM |
Hanlin with large number of books? | murraypaul | HanLin eBook | 3 | 06-23-2008 06:54 AM |