Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 08-06-2022, 11:31 AM   #1
albell
Member
albell began at the beginning.
 
Posts: 13
Karma: 10
Join Date: Oct 2016
Device: Onyx Boox N96ML
Apostrophes in Full text search

Still "playing" with this powerful new feature, I ran into a minor problem that I fear insoluble but I will try anyway

As an Italian-speaking user (but also in French there is this problem) I find myself looking for terms that in my books can be preceded by an article or a preposition followed by an apostrophe. For example: "dall'alto monte" (from the high mountain); "un'ottima cena" (a great dinner) and so on.

Now, searching only for words without an article ("alto monte" / "ottima cena") I expected to find those preceded by an article as well. Instead, the books in which these forms are present do not appear among the results. In practice, it seems that the forms preceded by article and apostrophe ("dall'alto", "un'ottima") are considered as unique terms.

I'm afraid it has to do with the apostrophe not being interpreted by the engine as a limit of a word, or what else?

Is there a way to bypass this problem or is it structural?

Many thanks!
albell is offline   Reply With Quote
Old 08-06-2022, 11:24 PM   #2
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 44,030
Karma: 22669822
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
There's no way to bypass it. Tokenization of text into words id done at indexing time, and once done its done. calibre uses the ICU library to do this tokenization and that uses language sensitive rules, for a number of languages including european ones.
kovidgoyal is offline   Reply With Quote
Advert
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Full Text Search query DrChiper Calibre 2 07-26-2022 06:31 AM
Full text search? excaliber Library Management 3 08-07-2017 06:09 AM
Full Text Search? silentguy Calibre 4 02-22-2012 03:03 PM
Full Text Search Engine Fat Abe General Discussions 1 09-21-2010 05:30 PM
Google Book Search to search full-text books online Bob Russell Deals and Resources (No Self-Promotion or Affiliate Links) 1 08-19-2006 12:13 PM


All times are GMT -4. The time now is 06:30 PM.


MobileRead.com is a privately owned, operated and funded community.