Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Sigil

Notices

Reply
 
Thread Tools Search this Thread
Old 12-21-2015, 02:43 PM   #16
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 8,770
Karma: 6000000
Join Date: Nov 2009
Device: many
Wow,
That is news to me! I am not a big user of index generation and so never learned that.
Thank you!
KevinH is offline   Reply With Quote
Old 12-21-2015, 08:08 PM   #17
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 2,625
Karma: 3120635
Join Date: Jan 2009
Device: Kindle PW3 (wifi)


@Doitsu
Very nice. Thanks for your explanations.

Thanks all for your replies. My apologies for these dumb questions but, as a Linux user, I have been out of the ball game quite a long time. Now because of the kindness of eschwartz who provides a very useful and up-to-date Arch package, I can taste the goodness of the new Sigil.

Last edited by roger64; 12-22-2015 at 04:14 AM. Reason: corrected
roger64 is offline   Reply With Quote
Advert
Old 12-22-2015, 10:12 AM   #18
eschwartz
Ex-Helpdesk Junkie
eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.
 
eschwartz's Avatar
 
Posts: 19,421
Karma: 85400180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
Oh, you're an Arch user?

FWIW, the Arch Linux devs are very good at keeping it up-to-date, but 'tis true if you want to take advantage of bugfixes as they come in, my AUR package which builds from git is pretty handy.
eschwartz is offline   Reply With Quote
Old 12-24-2015, 03:54 AM   #19
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 2,625
Karma: 3120635
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
Quote:
Originally Posted by eschwartz View Post
Oh, you're an Arch user?
Just a nephew (Antergos) but I use Arch and, from time to time Aur repos and packages.

A new question: my paperbook has one entry like this:
Code:
noir(e)
which means the words noir and noire have both to be indexed. No hierarchical meaning here.

How should I write this entry under the "Text to include" tab?

Last edited by roger64; 12-24-2015 at 03:55 AM. Reason: typo
roger64 is offline   Reply With Quote
Old 12-24-2015, 04:07 AM   #20
Doitsu
Grand Sorcerer
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 5,729
Karma: 24031401
Join Date: Dec 2010
Device: Kindle PW2
Quote:
Originally Posted by roger64 View Post
How should I write this entry under the "Text to include" tab?
You can use regular expressions in the Text to include box. I.e. you could use:

Text to include: noir[es]*
Index Entries: noir

EDIT: I forgot the final asterisk; without it the regex won't mach "noir." I added it to this post.

Last edited by Doitsu; 12-24-2015 at 07:26 AM.
Doitsu is offline   Reply With Quote
Advert
Old 12-24-2015, 04:15 AM   #21
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 2,625
Karma: 3120635
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
I am confused. In the online help, it is written [Ww]illow which means either Willow or willow
If I use:

Text to include: noir[es]

does not mean: index only noire and noirs ?
or it will index noir noire noirs?

In fact I am looking for noir and noire. the second one is feminine.

Last edited by roger64; 12-24-2015 at 04:23 AM. Reason: confused
roger64 is offline   Reply With Quote
Old 12-24-2015, 07:02 AM   #22
Divingduck
Wizard
Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.
 
Posts: 1,166
Karma: 1410083
Join Date: Nov 2010
Location: Germany
Device: Sony PRS-650
Quote:
Originally Posted by KevinH View Post
Wow,
That is news to me! I am not a big user of index generation and so never learned that.
Thank you!
The index generator is one of the things why I use Sigil from time to time. It is the only program that take care of it outside a text editor (as fare as I know)
Divingduck is offline   Reply With Quote
Old 12-24-2015, 07:24 AM   #23
Doitsu
Grand Sorcerer
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 5,729
Karma: 24031401
Join Date: Dec 2010
Device: Kindle PW2
Quote:
Originally Posted by roger64 View Post
does not mean: index only noire and noirs [..] or it will index noir noire noirs?
I made a stupid beginner's mistake in my initial post. I forgot to add the final asterisk, which excluded "noir" from the matches.

noir[es]* will match noir, noire, noirs and noires

Quote:
Originally Posted by roger64 View Post
In fact I am looking for noir and noire.
if you don't want to include the masculine and feminine plural forms, simply use noir[e]*.

Last edited by Doitsu; 12-24-2015 at 07:26 AM.
Doitsu is offline   Reply With Quote
Old 12-24-2015, 08:24 AM   #24
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 2,625
Karma: 3120635
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
@Doitsu

Excellent! Thanks for your explanation and patience
roger64 is offline   Reply With Quote
Old 12-24-2015, 08:28 AM   #25
Jellby
frumious Bandersnatch
Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.
 
Jellby's Avatar
 
Posts: 7,549
Karma: 19500001
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
Quote:
Originally Posted by Doitsu View Post
noir[es]* will match noir, noire, noirs and noires
and noirse, and noireee, and noirsssessessese

Quote:
if you don't want to include the masculine and feminine plural forms, simply use noir[e]*.
it wil also match noiree, noireee, noireeee...

I'd use noire?s? or noire? (if I'm right with the regex dialect). The ? means that the previous item (previous letter in this case) may appear only 0 or 1 time.

Or, if you don't want to think too much: noir|noire
Jellby is offline   Reply With Quote
Old 12-24-2015, 09:09 AM   #26
Doitsu
Grand Sorcerer
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 5,729
Karma: 24031401
Join Date: Dec 2010
Device: Kindle PW2
Quote:
Originally Posted by Jellby View Post
I'd use noire?s? or noire? (if I'm right with the regex dialect).
Sigil uses PCRE. I.e. both question marks and asterisks will work for zero or more occurrences.

Quote:
Originally Posted by Jellby View Post
Or, if you don't want to think too much: noir|noire
IMHO, that would be the safest option.

@roger64: You might want to adopt Jellby's suggestion and simply search for all variants using the following syntax:

noir|noire|noirs|noires

Of course you could also simply generate one entry for each variation. For example:

Code:
Text to include Index entries
noir            noir
noire           noir
noirs           noir
noires          noir
Doitsu is offline   Reply With Quote
Old 12-24-2015, 11:58 AM   #27
Jellby
frumious Bandersnatch
Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.
 
Jellby's Avatar
 
Posts: 7,549
Karma: 19500001
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
Quote:
Originally Posted by Doitsu View Post
Sigil uses PCRE. I.e. both question marks and asterisks will work for zero or more occurrences.
According to http://www.pcre.org/current/doc/html...ntax.html#SEC9:
? zero or one
* zero or more
Jellby is offline   Reply With Quote
Old 12-24-2015, 12:08 PM   #28
Doitsu
Grand Sorcerer
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 5,729
Karma: 24031401
Join Date: Dec 2010
Device: Kindle PW2
Quote:
Originally Posted by Jellby View Post
According to http://www.pcre.org/current/doc/html...ntax.html#SEC9:
? zero or one
* zero or more
I stand corrected.
Doitsu is offline   Reply With Quote
Old 12-24-2015, 08:33 PM   #29
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 2,625
Karma: 3120635
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
Quote:
Originally Posted by Jellby View Post
Or, if you don't want to think too much: noir|noire
Excellent.

I am preparing -slowly- an index file with about 500 entries. Your remarks are quite useful to help me get through.

Thank you both.
roger64 is offline   Reply With Quote
Old 12-24-2015, 11:54 PM   #30
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 2,625
Karma: 3120635
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
A practical test

Hi

I did a test using 30 entries (out of a 550 medium-size index) from a French book. It was mostly successful though some quirks seem to need to be ironed out. Please find attached here in the zip file:
- petiteliste.txt which is the text file I used to import and generate the index
- index.xhtml which is the index output that Sigil 9.2. (Linux Arch build) produced.
You can reproduce it by copying the text list in any EPUB text file. I also provided a screenshot of the index file in Sigil.

I - Two points that probably need to be considered.

1. Entries sorted out of the beginning of the alphabetical order.

For no reason than I can explain, some entries were classed right at the beginning of the alphabetical order under no heading. Here you can see it happen for Armani, but not Acton and for apparat and ascot but for not other words beginning with an a.

In the medium-sized index, this phenomenon did happen too and the entries concerned (between 5 to 10% of the total amount) seemed to be chosen randomly.

2. Letters with diacritics classed at the end of the alphabetical order.

Some words beginning with an accented letter (like âge or écossais) are sorted out under their own heading at the end of the alphabetical order. For the French language at least, they should be classed like if there was no diacritic: for example to be on the light side, in a French dictionary you can find âge between say affreux and agonie.

II - Practical tips.

3. Using exclusively the vertical bar (pipe).

In previous tries, I had problems with "Text to include" entries containing these very valid regex: (?i), [es] which were not dealt with but reproduced as such in the entry list. I dropped them totally and used exclusively the pipe (|) which works quite reliably for processing the index and can cope with all the cases I need.

I do not know if this problem comes from my Linux build or can be reproduced with other instances of Sigil.

4. Using a fixed value for the tab.

I used a Linux text editor named gedit to prepare the text file. I found handy to specify it in the Preferences to keep a fixed value for the tab (I chose 24). This allows for a more unified presentation.
Attached Thumbnails
Click image for larger version

Name:	indexscreenshot.png
Views:	202
Size:	45.4 KB
ID:	144888  
Attached Files
File Type: zip index.zip (2.3 KB, 108 views)

Last edited by roger64; 12-25-2015 at 05:56 AM. Reason: pipe and alphabetical order
roger64 is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Question about indexing on basic e-reader bonacker Amazon Kindle 9 02-01-2015 04:15 AM
Troubleshooting Indexing latepaul Amazon Kindle 13 01-15-2013 05:22 PM
Question about disable indexing permanently by disabling access to "Search Indexes" WS64 Kindle Developer's Corner 1 12-17-2011 05:51 PM
kindle 3 indexing question kpfeifle Amazon Kindle 2 09-06-2010 12:07 AM
Question about indexing Dragoro Amazon Kindle 4 02-25-2009 03:39 PM


All times are GMT -4. The time now is 07:46 PM.


MobileRead.com is a privately owned, operated and funded community.