Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Sigil

Notices

Reply
 
Thread Tools Search this Thread
Old 12-26-2015, 09:40 AM   #46
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 8,494
Karma: 5703586
Join Date: Nov 2009
Device: many
Qt QString does have a locale aware comparison function but the user's locale is used and it can not be set in the function call. On Mac OSX it uses the International Setting Preferences to determine the locale to use.

The only issue then would be the ebook author who runs Sigil must run Sigil in the locale for where the book is to be read. For those authors running Sigil in their native language, there should be no issues. For those ebook professionals who develop ebooks with multiple translations this is going to be an issue.

I can try to change the code to use a locale aware comparison, but handling this in a plugin may be better as it gives you more control for locale source and destination possibilities.
KevinH is online now   Reply With Quote
Old 12-26-2015, 09:55 AM   #47
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 2,624
Karma: 3120635
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
Quote:
The only issue then would be the ebook author who runs Sigil must run Sigil in the locale for where the book is to be read. For those authors running Sigil in their native language, there should be no issues. For those ebook professionals who develop ebooks with multiple translations this is going to be an issue.
To expect from a [locale]* publisher to use a [locale] interface of Sigil does not seem to be an undue requirement. This would already be a big step forward for the "accented" world...
* insert your own language here.

Quote:
I can try to change the code to use a locale aware comparison, but handling this in a plugin may be better as it gives you more control for locale source and destination possibilities.
Let see if there are some proposals..

Last edited by roger64; 12-26-2015 at 09:57 AM. Reason: locale
roger64 is offline   Reply With Quote
Old 12-26-2015, 08:14 PM   #48
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 2,624
Karma: 3120635
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
I've found this article

https://en.wikipedia.org/wiki/Alphab...ic_conventions

which gives some insight about this question. It explains why it's sound to abide by language specific conventions when using alphabetic order lists.

Of course there is also this, but for experts only.

Last edited by roger64; 12-26-2015 at 08:19 PM.
roger64 is offline   Reply With Quote
Old 12-26-2015, 10:28 PM   #49
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 8,494
Karma: 5703586
Join Date: Nov 2009
Device: many
roger64,
There is no issue about how to go about this. Only if it would be better done in a plugin or not. Doing it in a plugin would allow the ebook developer to specify the target locale independently from the current locale the ebook developer is operating in. Whereas the localeAwareCompare used in QString in Qt does not.

Last edited by KevinH; 12-26-2015 at 10:31 PM.
KevinH is online now   Reply With Quote
Old 12-27-2015, 09:02 AM   #50
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 2,624
Karma: 3120635
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
@KevinH
Thanks for your very useful explanation - and patience.
roger64 is offline   Reply With Quote
Old 12-30-2015, 08:10 PM   #51
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 2,624
Karma: 3120635
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
About index-size

This will be my last 2015 question.

It's related to index-size but maybe the answer is not Sigil related. I have been puzzled by some figures that I checked with both the Calibre Editor and Sigil: they provide very exactly the same figures.

Take an EPUB with a medium-size index included (530 entries or so).
The total size of this EPUB (images, fonts, everything) is 2129 k
When I open this EPUB and check the size of index.xhtml (either using Sigil report tool or right from the Calibre editor file navigator) it's 364 k
I suppress index.xhtml, plus what is in the Index Editor window (I know it's a little more than 7k.
I save and open again: the new size of the EPUB is now 2098k.

This process can be reversed. Importing a 7k text file, creating a 364k index file on this 2098k EPUB and ending with a 2129k EPUB.

I am just puzzled...

A logical answer would be that the suppressed files were not really suppressed. Is this possible? After deleting, I did save.

Last edited by roger64; 12-30-2015 at 08:25 PM. Reason: italics
roger64 is offline   Reply With Quote
Old 12-30-2015, 09:43 PM   #52
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 8,494
Karma: 5703586
Join Date: Nov 2009
Device: many
An epub is compressed. If you remove a file from an epub, you will only save the size of the compressed version of the file. So your 364k file compressed was 2129k - 2098k = 31k in size roughly. Given many of the links have similar hrefs, that seems okay to me.

Perhaps, I do not understand what you are asking?
KevinH is online now   Reply With Quote
Old 12-31-2015, 03:24 AM   #53
Doitsu
Grand Sorcerer
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 5,687
Karma: 24031401
Join Date: Dec 2010
Device: Kindle PW2
Quote:
Originally Posted by roger64 View Post
I suppress index.xhtml, plus what is in the Index Editor window (I know it's a little more than 7k.
I save and open again: the new size of the EPUB is now 2098k.
I'm also not quite sure what you're asking about. (I'm assuming that by suppress you mean supprimer, which is remove in English.)

If you generate an index, Sigil will add a special index entry to each paragraph in which the entry occurs. It has the following format:

Code:
id="sigil_index_id_1"
Even if you delete Index.xhtml and the Index editor entries, these special ids won't be automatically deleted. I.e., each initial index generation will increase the file size of an ePub.
If the file size of an ePub is important to you, you'll have to manually remove these ids from the epub file using a regular expression:

Code:
id="sigil_index_id_\d+"
Doitsu is offline   Reply With Quote
Old 12-31-2015, 05:10 AM   #54
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 2,624
Karma: 3120635
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
Thanks for your answers and sorry for being unclear. Yes by using "suppress", I really meant "remove"...

OK: I will clean thoroughly this EPUB of all its index tags and do the maths again... in early 2016.

I wish you all and Sigil a happy New Year.
roger64 is offline   Reply With Quote
Old 12-31-2015, 09:31 PM   #55
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 2,624
Karma: 3120635
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
This question arises from a casual look I gave to a Sigil report on the EPUB where I had added an index: I noticed that the total size of all text files was 473k and that the size of the index file was 364k. Really? an index whose size is 3/4 of the text files? I wanted to know more about it.

Here is the answer: basicallly, as Kevin said, it's all about compressed and uncompressed sizes. But if you do not check it, you do not know it...

Without any index at all,
the compressed EPUB size is 2092 k
the uncompressed EPUB size is 2608 k
But once indexed,
the compressed EPUB size increased to 2129 k (a + 37 k)
the uncompressed EPUB size increased to 2978 k (a + 370 k)

It appears from these figures that index data are particularly sensitive to compression (from 37 k compressed size to 370 k overall* uncompressed size).

@Doitsu
Other information: the compressed size of the 1657 index tags was about 7 k but this figure only adds to the text files, not to the index.

* saying "overall", I count the 364 k of the index.xhtml file plus the 7 k text data above.
The data left in the window of the index editor belong to Sigil.

Last edited by roger64; 01-03-2016 at 11:36 PM. Reason: data
roger64 is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Question about indexing on basic e-reader bonacker Amazon Kindle 9 02-01-2015 04:15 AM
Troubleshooting Indexing latepaul Amazon Kindle 13 01-15-2013 05:22 PM
Question about disable indexing permanently by disabling access to "Search Indexes" WS64 Kindle Developer's Corner 1 12-17-2011 05:51 PM
kindle 3 indexing question kpfeifle Amazon Kindle 2 09-06-2010 12:07 AM
Question about indexing Dragoro Amazon Kindle 4 02-25-2009 03:39 PM


All times are GMT -4. The time now is 06:05 PM.


MobileRead.com is a privately owned, operated and funded community.