04-08-2019, 06:11 AM | #916 | |
null operator (he/him)
Posts: 20,587
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
My experience is that the indexing using default settings has no discernible effect on calibre performance or functionality on 100,000K book libraries. Search response is pretty much instantaneous, you can 'post the results' to calibre via the Drop Search Results plugin - which will Mark the Books, and from there you can tag, add to reading lists, save , send etc, etc. BR |
|
05-29-2019, 08:34 AM | #917 |
Zealot
Posts: 132
Karma: 10
Join Date: Oct 2015
Device: Sony Reader, Tolino Shine, Samsung Galaxy S3
|
I have a question regarding the plugin:
Is there any way to do a search and have the plugin disregard the difference between plain and typographic punctuation? Because that small but important difference is a hindrance to the "search epub" option of the plugin. This is why this question has come up for me: Some time ago, I was trying to figure out which ebook among the several hundreds I have a certain quote was from. I was pretty sure I remembered the sentence verbatim, and just in case tried out several possible variations, so as to help me find the correct ebook with the help of the quality check plugin's "search epubs" feature. I couldn't find the book, so I gave up. Several days later, I *accidentally* found the correct book, and noticed this: The quote I tried to search was indeed verbatim. The reason why the plugin would not find the ebook is the fact that the apostrophes in the quote were typographic ones instead of the plain ones, and me using the plain ones in the phrase I searched with caused the plugin to disregard the ebook. So is there any way to make sure that problem never pops up again? Thank you kindly! |
Advert | |
|
05-30-2019, 06:44 AM | #918 | |
Zealot
Posts: 108
Karma: 810
Join Date: Jul 2012
Device: Kobo
|
Quote:
a) although not specifically what you are referring to, there is already a "plain text content" option that can be selected so that html format features such as italics etc are disregarded in the search, but that does not currently resolve the difference between plain and typographic punctuation; there is also an option to "ignore case" so that capitalization does not matter in the search, b) when searching for specific text, perhaps choose sections of text that do not include punctuation such as apostrohes, c) if it is necessary to use search text that does include quotation marks and/or apostrophes, then for each occurrence of such punctuation, maybe include specific alternatives in your search request in the form of [\'\’ \"\“\”] for each occurrence, d) perhaps the plugin could have a seach option that ignores all punctuation and spaces and line breaks? Last edited by Rob557; 05-30-2019 at 06:46 AM. |
|
05-30-2019, 06:23 PM | #919 |
null operator (he/him)
Posts: 20,587
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
IIRC the epub viewer's Find feature has similar behaviour wrt apostrophes and quote marks. I would not be surprised if QCs search didn't use the same code as the viewer's Find.
I use Windows Search in conjunction with the Drop Search Results plugin, it seems to find curly quotes etc when straight ones are typed in its search box. To search EPUBs you need to install an EPUB iFilter, see ==>> Content Search EPUB. WS is fast and it doesn't clog up calibre with indexes or extraneous formats. And you can widen the search to include non calibre objects - emails, spreadsheets, presentations, documents etc. The equivalent on a Mac is Spotlight, there's a thread somewhere on MR about using it with calibre. BR Last edited by BetterRed; 05-30-2019 at 06:26 PM. |
05-31-2019, 10:10 AM | #920 |
Zealot
Posts: 132
Karma: 10
Join Date: Oct 2015
Device: Sony Reader, Tolino Shine, Samsung Galaxy S3
|
Thank you kindly for your answers, Rob557 and BetterRed! Much appreciated!
While I'm glad it wasn't a case of me overlooking an obvious feature of the plugin, it's sad that this is a weakness of this otherwise great plugin. It's not always applicable to just choose a part of a sentence that doesn't have punctuation and the like, because what do you do when the only part you are 100% certain on is something that does include them? But thank you kindly for the tips! Especially the idea with using variants of the typographic punctuation in the search in future. That certainly would have helped a lot if I had thought about it at the time. Unfortunately, because I tend to make sure to edit all my ebooks so that they all have plain punctuation, the few times I do miss doing that edit, it trips me up like this. If the plugin could be modified to ignore all typographic punctuation that would be so great! I wonder if it's doable. BetterRed: Yeah, the viewer has that selfsame behaviour. Thank you kindly for the link! But I'm guessing it won't be applicable for me, since my OS is Ubuntu. And the system standard search tool in ubuntu doesn't allow for epubs to be searched anyway, hence why I am using the quality check plugin. Maybe there are other search tools that do allow for epub searching and I just haven't found them yet? Hmm... |
Advert | |
|
05-31-2019, 10:32 AM | #921 | |
hopeless n00b
Posts: 5,111
Karma: 19597086
Join Date: Jan 2009
Location: in the middle of nowhere
Device: PW4, PW3, Libra H2O, iPad 10.5, iPad 11, iPad 12.9
|
Quote:
e.g. prophecy.+mule |
|
05-31-2019, 11:19 AM | #922 |
Zealot
Posts: 132
Karma: 10
Join Date: Oct 2015
Device: Sony Reader, Tolino Shine, Samsung Galaxy S3
|
|
05-31-2019, 12:06 PM | #923 | |
hopeless n00b
Posts: 5,111
Karma: 19597086
Join Date: Jan 2009
Location: in the middle of nowhere
Device: PW4, PW3, Libra H2O, iPad 10.5, iPad 11, iPad 12.9
|
Quote:
For the most part, I just use it to find books with certain phrases similar to above: Code:
Scope: Plain text content \b(word1|word2).+word3 Code:
Scope: HTML content <img[^>]+src=['"]http |
|
05-31-2019, 12:37 PM | #924 | |
Zealot
Posts: 132
Karma: 10
Join Date: Oct 2015
Device: Sony Reader, Tolino Shine, Samsung Galaxy S3
|
Quote:
It's been a while, so I don't remember what I tried to use, but some regex or other that usually worked for me didn't for QC, so I've been basically doing without because I didn't have the patience to deal with it. Guess I should shelf my assumptions and revisit my decision to do without regex. Thank you kindly for the tips! |
|
05-31-2019, 02:53 PM | #925 | |
hopeless n00b
Posts: 5,111
Karma: 19597086
Join Date: Jan 2009
Location: in the middle of nowhere
Device: PW4, PW3, Libra H2O, iPad 10.5, iPad 11, iPad 12.9
|
Quote:
|
|
05-31-2019, 06:05 PM | #926 | |
null operator (he/him)
Posts: 20,587
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
There's a calibre plugin for Recoll integration - link in the Plugin Index. It has been abandoned, so it is not compatible with recent versions of calibre, but you might be able to resurrect it. And try a search for Lucene, its a widely used search engine for *x platforms. Added: IIRC the Multi-column search plugin has content search capabilities, I've not used it, but its developer is very active, so if it exhibits the same behaviour there's a good chance he'll fix it. BR Last edited by BetterRed; 05-31-2019 at 06:12 PM. |
|
05-31-2019, 06:50 PM | #927 |
hopeless n00b
Posts: 5,111
Karma: 19597086
Join Date: Jan 2009
Location: in the middle of nowhere
Device: PW4, PW3, Libra H2O, iPad 10.5, iPad 11, iPad 12.9
|
I do use MCS full text search quite a bit. Note, it can't search for text in ePub. It has to be plain TXT files.
|
05-31-2019, 11:16 PM | #928 |
null operator (he/him)
Posts: 20,587
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
|
06-11-2019, 12:50 PM | #929 |
Junior Member
Posts: 3
Karma: 10
Join Date: Mar 2009
Device: iPad
|
Question about Adobe and DRM
You have a function to search for Adobe DRM and DRM in ebooks. What is the string you are searching for? I am getting books listed that do not contain DRM.
|
06-11-2019, 02:51 PM | #930 |
Junior Member
Posts: 3
Karma: 10
Join Date: Mar 2009
Device: iPad
|
So, more research and I answered it myself ... I installed the plugin "Modify Epub", and it will remove them the DRM meta tags for me. Works great!
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
[GUI Plugin] Clipboard Search | kiwidude | Plugins | 29 | 04-02-2024 10:05 PM |
[GUI Plugin] Search the Internet | kiwidude | Plugins | 433 | 04-01-2024 05:48 PM |
[GUI Plugin] Open With | kiwidude | Plugins | 403 | 04-01-2024 08:39 AM |
[GUI Plugin] Kindle Collections (old) | meme | Plugins | 2070 | 08-11-2014 12:02 AM |
[GUI Plugin] Book Sync **Deprecated** | kiwidude | Plugins | 111 | 06-07-2011 07:47 PM |