Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book General > General Discussions

Notices

Reply
 
Thread Tools Search this Thread
Old 06-28-2012, 08:00 AM   #1
fufu42
Junior Member
fufu42 began at the beginning.
 
Posts: 3
Karma: 10
Join Date: May 2012
Device: Kobo Touch
Best way to search inside ebook library?

Hi, I've been searching for a while on all kinds of places and decided to ask here for the following problem.

I have a collection of ebooks in many different formats from txt over pdf to epub, mobi etc. all inside a calibre directory. Now calibre unfortunately has ebook viewing capabilitey but no full-text search function.

From my point of view the least requirements of search functionality would be:
- regular expressions in full text search
- search inside all common ebook formats
- unicode support

optional but not vital:
- pre indexed file content
- search inside archived files
- on the fly indexing

I collected the following info up to now:

- The program Beagle fur Linux seems to have met my needs but isn't maintained any longer. I didn't try to install the latest version, did anybody lately?

- Google Desktop is discontinued - but probably had no regex but only boolean operators

- Copernic Desktop search seems interesting but the developer site states nothing about regular expressions. I haven't tried it. Has anyone?

-Agent Ransack which i just tried seems interesting but probably can't search inside epub (and other formats) though is does fast regex in fulltext with pdf, archived and other plain text-like files. (If I'll decide to use that program it would somewhat oddly mean that I'd have to convert all non-pdf files to pdf...) Agent Ransack does no indexing ahead of search.

- I wouldn't hesitate to use command line tools. Basically grep can do all I need for now. The Question then would be how to extract a corpus with the necessary file information from the library including pdf and epub formats.


Any other suggestions?

Thanks in advance!

Last edited by fufu42; 06-28-2012 at 09:57 AM.
fufu42 is offline   Reply With Quote
Old 12-17-2013, 09:32 PM   #2
Earthlark
Member
Earthlark knows what time it isEarthlark knows what time it isEarthlark knows what time it isEarthlark knows what time it isEarthlark knows what time it isEarthlark knows what time it isEarthlark knows what time it isEarthlark knows what time it isEarthlark knows what time it isEarthlark knows what time it isEarthlark knows what time it is
 
Posts: 13
Karma: 2386
Join Date: Nov 2012
Device: Kindle Touch
Did you ever find a solution to your problem? I have some dense ebooks with repeating terms related to different applications so I've been trying to figure out how to do boolean or regular expression searches within a text (I suppose within sentences... or paragraphs.), but haven't found anything good yet. I guess I can figure out how to use grep with them, but it'd be nice if there were some sort of ebook viewer that allowed for easily doing this kind of search... Find anything?. Cheers.
Earthlark is offline   Reply With Quote
Advert
Old 12-18-2013, 04:55 AM   #3
HarryT
eBook Enthusiast
HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.
 
HarryT's Avatar
 
Posts: 85,544
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
There's no point in asking a question to someone whom you can see for yourself hasn't visited MR for over a year.
HarryT is offline   Reply With Quote
Old 12-18-2013, 05:09 AM   #4
latepaul
Wizard
latepaul ought to be getting tired of karma fortunes by now.latepaul ought to be getting tired of karma fortunes by now.latepaul ought to be getting tired of karma fortunes by now.latepaul ought to be getting tired of karma fortunes by now.latepaul ought to be getting tired of karma fortunes by now.latepaul ought to be getting tired of karma fortunes by now.latepaul ought to be getting tired of karma fortunes by now.latepaul ought to be getting tired of karma fortunes by now.latepaul ought to be getting tired of karma fortunes by now.latepaul ought to be getting tired of karma fortunes by now.latepaul ought to be getting tired of karma fortunes by now.
 
latepaul's Avatar
 
Posts: 1,270
Karma: 10468300
Join Date: Dec 2011
Device: a variety (mostly kindles and kobos)
There's a Calibre plugin called "Quality Check" that has a search epubs option. Last time I checked it only worked on epubs but that was a while ago
latepaul is offline   Reply With Quote
Old 12-20-2013, 05:58 PM   #5
Earthlark
Member
Earthlark knows what time it isEarthlark knows what time it isEarthlark knows what time it isEarthlark knows what time it isEarthlark knows what time it isEarthlark knows what time it isEarthlark knows what time it isEarthlark knows what time it isEarthlark knows what time it isEarthlark knows what time it isEarthlark knows what time it is
 
Posts: 13
Karma: 2386
Join Date: Nov 2012
Device: Kindle Touch
Thanks, latepaul. I checked it out a bit, and it looks pretty useful, especially if one needs to search across multiple volumes for related terms/concepts. My regex skills are too little, so once I figure out the correct input, it should work fine. In the meantime, I'll probably also post something else to see if there are any other options that fit my purposes in a more user-friendly way--without too much regex and allowing me to directly jump to locations, etc. Thanks again. Merry Christmas.
Earthlark is offline   Reply With Quote
Advert
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Public Library eBook Subject Search SteveEisenberg General Discussions 3 02-04-2012 09:59 AM
Search inside books; bookmarks dhalbert Which one should I buy? 1 06-16-2011 11:48 AM
Printable life size comparison of Ebook readers inside scheichxodox Which one should I buy? 0 03-13-2010 02:56 AM
Ebook Reader size comparison inside scheichxodox Which one should I buy? 14 02-17-2010 05:58 AM
Help with Book Designer and Links inside ebook alophind Sony Reader 17 08-30-2007 11:31 PM


All times are GMT -4. The time now is 05:57 PM.


MobileRead.com is a privately owned, operated and funded community.