|
|
Thread Tools | Search this Thread |
05-17-2019, 07:18 PM | #1 |
Zealot
Posts: 132
Karma: 13892
Join Date: Mar 2010
Device: Ipad, Kindle 7
|
Scan digital library and extract all Table of Contents from all books?
Is anyone aware of a tool or data extraction method to do this? Anything besides Calibre/plugin?
|
05-17-2019, 10:09 PM | #2 |
C L J
Posts: 2,912
Karma: 21115458
Join Date: Dec 2008
Location: Birmingham UK
Device: Sony e-reader 505, Kindle PW2, Kindle PW3, Kobo Libra2
|
What a peculiar thing to want to do! Please explain, I'm perplexed and curious.
|
Advert | |
|
05-18-2019, 12:41 AM | #3 |
Wizard
Posts: 1,841
Karma: 9547754
Join Date: Jul 2009
Location: Newcastle, Australia
Device: iPhone SE2020
|
Me too!!
|
05-18-2019, 04:20 AM | #4 |
“Tis but a scratch!”
Posts: 87
Karma: 12
Join Date: May 2019
Location: South Australia
Device: Mac 2007
|
|
05-18-2019, 06:16 AM | #5 |
Wizard
Posts: 1,280
Karma: 29121666
Join Date: Mar 2010
Location: UK
Device: Kobo Forma, Icarus, iPad Mini 2, Kobo Touch, Google Nexus 7
|
I can see this being very useful for anthologies.
|
Advert | |
|
05-18-2019, 07:16 AM | #6 |
“Tis but a scratch!”
Posts: 87
Karma: 12
Join Date: May 2019
Location: South Australia
Device: Mac 2007
|
Just looking at some Spiralizer recipe books in KU and obviously such an index record of recipes would be great to pull out eg Pad Thai recipes and see which books had them.
But of course my Kindle Reader (1.71 for Mac) won't allow cut & Paste - and Kindle Cloud (just checked) so just keeping a text file that you can search is out as far as Kindle is concerned. So will continue to watch here for any other ideas. |
05-19-2019, 06:47 PM | #7 | |
Grand Sorcerer
Posts: 11,732
Karma: 128354696
Join Date: May 2009
Location: 26 kly from Sgr A*
Device: T100TA,PW2,PRS-T1,KT,FireHD 8.9,K2, PB360,BeBook One,Axim51v,TC1000
|
Quote:
If you have a lot of books, do the collection in sections. This really is a job for a database manager, even a simple one, or a spreadsheet which is why you want the extracted TOC in rtf or Word format. You can then turn it into a table or csv file. |
|
05-20-2019, 07:32 AM | #8 | |
“Tis but a scratch!”
Posts: 87
Karma: 12
Join Date: May 2019
Location: South Australia
Device: Mac 2007
|
Quote:
It also occurred to me afterward that if you don't want to/use Calibre, for all sorts of reasons, you can do a screenclip of the TOC and do an OCR to populate a spreadsheet file for lets say, Spiraliser recipes. Then when you search for Pad Thai - you can then reload the original KU title and access the particular recipe. That way, if indeed KU actually pays authors for "fair use", no one loses. Amazon get their monthly fee and the author gets their "click" benefit |
|
05-20-2019, 05:07 PM | #9 |
Zealot
Posts: 132
Karma: 13892
Join Date: Mar 2010
Device: Ipad, Kindle 7
|
@fjtorres Thanks. Yeah, I was thinking of something like this, but since it numbers in the thousands it might be a little too much manual work. To answer why, basically I have weird research methods
Would be cool if I could run a script to OCR after TOC and then auto-populate a spreadsheet. Probably too niche to expect a tool like that to be out there and a little out of my scripting skills. Last edited by bounce; 05-20-2019 at 05:12 PM. |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Troubleshooting Kindle Paperwhite goes to table of contents when switching books | JeremyR | Amazon Kindle | 8 | 10-21-2018 09:30 AM |
Any way to add to/edit table of contents for epub books? | bookw0rm | Calibre | 2 | 02-20-2017 11:50 AM |
Force scan of library folder for unadded books | Barty | Library Management | 3 | 08-01-2015 04:18 PM |
Extract table of contents from mobi file | oecherprinte | Kindle Formats | 7 | 04-16-2012 12:10 PM |
Books with TABLE OF CONTENTS | dxt78 | Amazon Kindle | 9 | 12-11-2007 06:51 PM |