Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > Other formats

Notices

Reply
 
Thread Tools Search this Thread
Old 12-04-2013, 06:50 PM   #1
MelBr
Zealot
MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.
 
Posts: 105
Karma: 414068
Join Date: Feb 2013
Device: iPad Pro, Kobo Aura One
Djvu: Extracting ISBN numbers from a large number of books?

Hi all,

I have a large number of old math books and they're all in djvu format and filenames are somewhat messy. I'd like to import them all into Calibre so I can categorize them and fix names etc.

Is there an easy(ier) way to extract ISBN numbers other than converting them all to PDF and then running OCR on all of them? I'd really like to avoid that since that would take very long time, would produce large files, and there's still no guarantee that it would OCR all of the ISBNs properly. Option of last resort is of course to manually type all the ISBN numbers. I'd like to avoid that one of course

Thanks for any tips!

Mel
MelBr is offline   Reply With Quote
Old 12-05-2013, 12:33 PM   #2
DaleDe
Grand Sorcerer
DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.
 
DaleDe's Avatar
 
Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
I guess it would depend on where the ISBN is stored in the file. This would typically be in some of the metadata. What viewer are you using to look at the file. Can it view the metadata? See DJVU in our wiki for some viewers you might try.

Dale
DaleDe is offline   Reply With Quote
Advert
Old 12-06-2013, 04:33 PM   #3
MelBr
Zealot
MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.MelBr ought to be getting tired of karma fortunes by now.
 
Posts: 105
Karma: 414068
Join Date: Feb 2013
Device: iPad Pro, Kobo Aura One
Thank you for a comment, Dale!

Unfortunately, these files don't seem to have any metadata. I've checked it with few viewers. For example, DjView shows this:




I'm starting to believe that only some type of manual process is the only thing that will work. I'm looking at various OCR software packages to see if any of them can scan only first N pages but then I'll have to extract ISBN manually anyway… or maybe let Calibre plugin extract it somehow. I'm still thinking about the best way to go about this.
MelBr is offline   Reply With Quote
Old 12-06-2013, 08:34 PM   #4
DaleDe
Grand Sorcerer
DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.
 
DaleDe's Avatar
 
Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
Where do you see the ISBN data? On a title page, near the beginning of the book? I suspect you will need to open the book to see it.

Dale
DaleDe is offline   Reply With Quote
Old 12-06-2013, 08:48 PM   #5
DaleDe
Grand Sorcerer
DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.
 
DaleDe's Avatar
 
Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
You might try the tools at http://djvu.sourceforge.net/index.html. They have a djvused tool that is somewhat like Unix sed to extract data from a file.

Dale
DaleDe is offline   Reply With Quote
Advert
Old 12-09-2013, 01:27 PM   #6
BobC
Guru
BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.
 
Posts: 691
Karma: 3026110
Join Date: Dec 2008
Location: Lancashire, U.K.
Device: BeBook 1, BeBook Pure, Kobo Glo, (and HD),Energy Sistem EReader Pro +
I've also found DJVUTOY a useful piece of software for manipulating DJVUs. :

http://www.comicer.com/stronghorse/s...jVuToy_eng.zip

is an English version, unfortunately the main site is in Chinese.

With it you can split and merge DJVUs, insert bookmarks and manipulate hidden text by exporting it, editing it then re-importing it. I used an early version for creating effectively a clickable TOC for a number of DJVUs. Documentation is sparse so a bit of experimentation is needed to get the best out of it.

BobC
BobC is offline   Reply With Quote
Old 12-21-2013, 07:11 PM   #7
willus
Fuzzball, the purple cat
willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.
 
willus's Avatar
 
Posts: 1,273
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
@MelBr -- Any chance a sample .djvu file could be posted?
willus is offline   Reply With Quote
Old 04-13-2014, 03:35 AM   #8
Noobish
Junior Member
Noobish can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterNoobish can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterNoobish can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterNoobish can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterNoobish can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterNoobish can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterNoobish can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterNoobish can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterNoobish can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterNoobish can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterNoobish can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameter
 
Posts: 4
Karma: 12584
Join Date: Apr 2014
Device: none
Quote:
Originally Posted by MelBr View Post
Hi all,

I have a large number of old math books and they're all in djvu format and filenames are somewhat messy. I'd like to import them all into Calibre so I can categorize them and fix names etc.

Is there an easy(ier) way to extract ISBN numbers other than converting them all to PDF and then running OCR on all of them? I'd really like to avoid that since that would take very long time, would produce large files, and there's still no guarantee that it would OCR all of the ISBNs properly. Option of last resort is of course to manually type all the ISBN numbers. I'd like to avoid that one of course

Thanks for any tips!

Mel
I had the same problem, i like my books in pdf since it is compatible with many OS/Devices and can be compressed , for some reason the metadata is lost after conversion to pdf . I use free software btw for that.

check https://www.mobileread.com/forums/showthread.php?t=237519&highlight=[GUI+Plugin]+Extract+ISBN
Noobish is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Export ISBN numbers for import to Google books IMFletch Calibre 10 06-22-2017 11:06 AM
Aura How does aura handle large numbers of books? jlynton Kobo Reader 4 09-20-2013 03:30 AM
Large number of books on memory card pwalker8 Sony Reader 8 03-24-2009 02:20 PM
PRS-505 with large number of books? murraypaul Sony Reader 22 07-08-2008 01:23 AM
Hanlin with large number of books? murraypaul HanLin eBook 3 06-23-2008 06:54 AM


All times are GMT -4. The time now is 03:31 PM.


MobileRead.com is a privately owned, operated and funded community.