Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Library Management

Notices

Reply
 
Thread Tools Search this Thread
Old 06-09-2012, 09:39 PM   #1
dnjmama
Junior Member
dnjmama began at the beginning.
 
Posts: 4
Karma: 10
Join Date: Jun 2012
Device: nook
Searching ebook body/text across library-Is it Possible?

Hope this is the correct forum for this….

I am looking for a way to search the ebook text across a calibre library for a literal string (e.g. the word “maypole” or the name “John Smith”, etc). I do NOT mean the metadata, but the actual ebook text.

Can anyone tell me how I can do this in Calibre?

I know I can search within a given ebook, but I am looking to search across an entire library to find the book(s) containing the string.

The only thing I have found that relates, is in the powerful “search and replace” function under “Convert books”. I realize I could search for a string of “maypole” and replace with “maypole” (e.g. no real change), but I don’t see a list of the books where the change was actually found & replaced*.

Of course, it is risky to use a search and REPLACE function in the way, as one could accidentally replace/damage the original text—but at least it would be a workaround.

*better still, would be the book and a ‘clip’ of the text from each place in each ebook where the string was found—the ‘clip’ would be the search string surrounded with, say, the 15 words before and the 15 words after the search string occurance.

Why am I looking for this? As libraries grow, it is useful to search for many things: In fiction, the first/all places a character or location is referenced; In non-fiction, one might be searching for an ingredient(“carbon dioxide” or “chicken thigh”), or a city, or a person (“Albert Einstein”, etc), a technique (“godet” or “miter”), etcetera.

Thanks for reading this—Please let me know if I have missed something, or if there is something else I should try.

dnjmama is offline   Reply With Quote
Old 06-09-2012, 11:42 PM   #2
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,844
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
At the moment there is no way to search inside books in calibre, it's on my todo list to be implemented someday, along with about 300 other things
kovidgoyal is online now   Reply With Quote
Advert
Old 06-10-2012, 04:03 AM   #3
kiwidude
Calibre Plugins Developer
kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.
 
Posts: 4,636
Karma: 2162064
Join Date: Oct 2010
Location: Australia
Device: Kindle Oasis
You can search across ePub books for text using the Quality Check plugin. Only ePub format though and it doesn't currently show the context for the match though that has been asked for.
kiwidude is offline   Reply With Quote
Old 06-10-2012, 06:06 PM   #4
dnjmama
Junior Member
dnjmama began at the beginning.
 
Posts: 4
Karma: 10
Join Date: Jun 2012
Device: nook
Kovidgoyal: Thank you for the reply, nice to know it's on a list & I'm not alone in desiring it--as libraries grow, management tools become more necessary.

Kiwidude: Thank you for mentioning a potential aid--I will check it out. As author of the plugin: Are there any pointers or warnings you'd care to pass on about using your quality control plugin for searching within epub? Feel free to pass me to a posting, if you've already spelled this out somewhere...

(Thank you--I did get that this is for epubs only (& w/o context) but as that happens to be 85% of my materials, it might help--I'll have to see, the other 15% are PDF & I could convert, but probably 5-8% are primarily image based PDFs.)
dnjmama is offline   Reply With Quote
Old 06-10-2012, 06:50 PM   #5
kiwidude
Calibre Plugins Developer
kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.
 
Posts: 4,636
Karma: 2162064
Join Date: Oct 2010
Location: Australia
Device: Kindle Oasis
@dnjmama - not a huge amount of things I could say about it since it is a pretty straightforward dialog. You will only be interested in setting the scope to "HTML content" which I think is the default.

You can assign a keyboard shortcut to it via the Customise plugin -> Keyboard shortcuts option at the bottom of the Quality Check menu, or via Preferences -> Keyboard.

If you only want to search a subset of your library, do a normal calibre search for just the books of interest, use the "Restrict to" option on the top left of the screen and choose "*Current search". From that point onwards any features inside the Quality Check plugin including this Search ePubs option will be limited to just that same subset of books. Use that same Restrict to dropdown to set back to blank to look at all your books again.

When the search is completed, it will display all the books it found matches in. The View log option on the dialog will show you the details of exactly which html file page inside the ePub it found the first match in (it stops looking after finding the first occurrence in an ePub).

Other than that you will see that same plugin has a squillion other features, but they aren't relevant to your query in this thread. See the Help file option on the plugin's menu if you want to learn more about them. Enjoy...
kiwidude is offline   Reply With Quote
Advert
Old 06-12-2012, 01:00 AM   #6
unboggling
Wizard
unboggling ought to be getting tired of karma fortunes by now.unboggling ought to be getting tired of karma fortunes by now.unboggling ought to be getting tired of karma fortunes by now.unboggling ought to be getting tired of karma fortunes by now.unboggling ought to be getting tired of karma fortunes by now.unboggling ought to be getting tired of karma fortunes by now.unboggling ought to be getting tired of karma fortunes by now.unboggling ought to be getting tired of karma fortunes by now.unboggling ought to be getting tired of karma fortunes by now.unboggling ought to be getting tired of karma fortunes by now.unboggling ought to be getting tired of karma fortunes by now.
 
Posts: 1,065
Karma: 858115
Join Date: Jan 2011
Device: Kobo Clara, Kindle Paperwhite 10
Another option is to use the search facility of your Operating System (outside of calibre) to search file content of all files within one or more calibre Library folders, assuming all the Libraries are in one folder called, say "Calibre Libraries". The problem with this method is that "book formats" that are archives (multiple files packaged as one) such as EPUB, MOBI, ZIP, RAR get ignored.
unboggling is offline   Reply With Quote
Old 04-08-2013, 11:10 AM   #7
retval
Enthusiast
retval began at the beginning.
 
Posts: 29
Karma: 10
Join Date: Mar 2011
Device: none
Alternatively

I also need this option. It would be great if it were possible to search the entire library and at the same time within the texts. Ideally, the results appear in another window showing the term highlighted in context to know immediately if useful. If you work in this research would be very important as it will save many hours of searching.

While not incorporate Calibre I use Copernic Desktop Search. It is good and allows you to search all file types. must be given a long time to index the content.
retval is offline   Reply With Quote
Old 04-08-2013, 10:56 PM   #8
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 20,559
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by unboggling View Post
Another option is to use the search facility of your Operating System (outside of calibre) to search file content of all files within one or more calibre Library folders, assuming all the Libraries are in one folder called, say "Calibre Libraries". The problem with this method is that "book formats" that are archives (multiple files packaged as one) such as EPUB, MOBI, ZIP, RAR get ignored.
On OS/X with Spotlight, or on Linux with Recoll, EPUB and MOBI files can be content searched, as well as most other things

Windows can search CHM, DjVu, RAR, ZIP, CAB and other files if you install the IFilters available from MS, IFiltershop, Citeknet and IFilter.org, they're all free - some of the links at IFilter.Org are broken and some links are duplicates.

One simple and relatively inexpensive way of content indexing and searching ebooks on Windows is to create a TXT file, via Calibre Conversion. Windows will index the text files, and search will get you to the book folders by displaying a list of text files that meet your search criteria. Open file location will put you into the book folder and you can double click on the epub, mobi whatever - be careful, and don't change anything.

If you're searching your ebooks in Windows I recommend you don't do it from the Start menu, instead navigate to your library in Windows Explorer and search from there (top right corner). Searching from the Start menu can throw up too many 'What the heck is that' and "Where the hell is it then' results.

If you don't keep RAR or ZIP files in the book folders, then you could put the TXT in a ZIP or RAR, it should be quite a lot smaller, it will get indexed if the appropriate Ifilter is installed.

I never made Windows Desktop Search work satisfactorily on XP, Google and Copernic weren't any better. IMO the only one that was any good was the one from Yahoo, but they deprecated it - to the bit bucket

BR
BetterRed is offline   Reply With Quote
Old 11-18-2014, 06:57 AM   #9
phlame64
Member
phlame64 began at the beginning.
 
phlame64's Avatar
 
Posts: 19
Karma: 10
Join Date: May 2010
Location: The Internet
Device: Kobo Aura One
I don't know if i am allowed to talk about other software here, but you might use the (commercially available) dtSearch Desktop: h*tp://www.dtsearch.com/plf_desktop_2.html

It allows you to index text in your entire library (every single book, even terabytes of data) and search for specific words or sentences in a matter of seconds. It does not support the ePub format natively, but ePubs are nothing more than archives containing .xhtml files among the others, which cointain text that dtSearch can extract and index.

You might feel like using very strict filters to get exactly what you want, and results are displayed by relevance/occurrence in a separate window. Not just the word/sentence you were looking for, but an extract of the entire page will be shown in that window.

I use it regularly for research purposes.
phlame64 is offline   Reply With Quote
Old 11-20-2014, 10:41 AM   #10
LadyKate
Fanatic
LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.
 
Posts: 515
Karma: 1470724
Join Date: Jul 2013
Location: Quebec CA
Device: android 4 (samsung tablet and asus tablet)
A freebie utility "searchmyfiles" works well also.
LadyKate is offline   Reply With Quote
Reply

Tags
library, search


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
How can I change body text from bold to normal? Michaelus Calibre 13 07-28-2022 04:42 PM
Fonts for body text - a small study (long) derangedhermit Workshop 9 08-25-2011 04:27 AM
Change body text font from bld to normal Jackdusty Conversion 4 03-17-2011 01:21 PM
Searching text on 300 and 505 Thrasher Sony Reader 6 04-28-2010 03:41 AM
copy/paste from ebook text within library 2.5 Bierkonig Sony Reader 4 01-28-2009 05:17 PM


All times are GMT -4. The time now is 12:50 AM.


MobileRead.com is a privately owned, operated and funded community.