![]() |
#46 | |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 79,771
Karma: 145864619
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Quote:
The reason publishers have gotten away with incorrect encoding is because ADE forced the ePub to be rendered with UTF-8. Other programs that read ePub might not do so and they would have an incorrectly displayed ePub. I've also seen some cases where the wrong encoding causes the ePub to display in bold where it should not have. This was on a Sony Reader 505 using an old version RMDSK (ADE). So no, do not thing that the wrong encoding is OK just because whatever software you are seeing the eBook in looks like it should. Not every program will handle the wrong encoding like it was the correct encoding, |
|
![]() |
![]() |
![]() |
#47 |
Karmaniac
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,553
Karma: 11499146
Join Date: Oct 2008
Location: Miami FL
Device: PRS-505, Jetbook, + Mini, +Color, Astak Ez Reader Pro, PPW1, Aura H2O
|
When I did a search online, I did see how utf-8 is really taking over since the Millennium on web pages.
Seems like IBM introduced a lot of the coding work with ISO-8859-1, and the decline of it, and rise of UTF-8 seems to be linked with the decline of IBM, and rise of microsoft and apple, finally turning the tables with android being introduced to the market. The numbers/stats correlation might be coincidental, meaning it might be not directly related to IBM, but is still noteworthy that the newer, non-industrial companies, providing software for the home, were introduced at a time ISO-8859-1 was in decline. Last edited by ProDigit; 12-21-2015 at 07:27 AM. |
![]() |
![]() |
Advert | |
|
![]() |
#48 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
It has nothing to do with IBM, but with internationalization. More and more websites and applications (don't even mention databases) need to be able to handle multiple languages and characters. The old ISO pages cannot handle that due to restrictions. They invented all kind of tricks in the beginning, creating nightmares (I have seen enough). The adoptation of Unicode solved a lot, but it was a slow process. Funny that you mention Microsoft, since they were actually quite late to the party with regards to applications and Windows.
Quite frankly there is no reason at all at the moment not to use UTF. |
![]() |
![]() |
![]() |
#49 | |
Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 629
Karma: 1623086
Join Date: Jan 2012
Device: iPad, iPhone, Nook Simple Touch
|
Quote:
Because of that limitation, different language families required different encodings. You couldn't, for example, have French text and Greek text in the same document. Also, you couldn't support languages like Chinese at all, because there are many thousands of Chinese characters in common use. Worse, you couldn't determine just by looking at a file whether it was ISO-8859-1, ISO-8859-2, ISO-8859-16, etc., so you had to externally specify the character set in some way, or else French could turn into a gibberish Greek-English hybrid. In the case of HTML, they specified the character set in a meta tag, which meant that the browser would read the file up to that point, say "Oh, crap, I'm using the wrong encoding", then reread the entire file using the right encoding. It was an absolute mess, and I'm being generous here. The Unicode standard fixed all of this by supporting over a million possible characters. As a result, you can use a single character set to represent text in every language on Earth. With Unicode, you can have Chinese text, English text, French text, and Russian text on a single web page. UTF-8 is the most common way to encode Unicode text, because English characters, numbers, and punctuation are encoded using a single byte, making English UTF-8 content fully backwards-compatible with software that supports traditional 7-bit ASCII (and ISO-8859-*). Last edited by dgatwood; 12-21-2015 at 08:49 PM. |
|
![]() |
![]() |
![]() |
#50 |
temp. out of service
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,815
Karma: 24285242
Join Date: May 2010
Location: Duisburg (DE)
Device: PB 623
|
Not to forget codepage workarounds like mazovia encoding because the codepage for Polish had the block graphic symbols screwed up.
So yes. The pre-UTF times were definitely absolutely not funny. First, as soon as you linguistically left western Europe things tended to became ugly or needlessly complicated. Second, a working multilingual setup was something to dream about. |
![]() |
![]() |
Advert | |
|
![]() |
#51 |
Karmaniac
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,553
Karma: 11499146
Join Date: Oct 2008
Location: Miami FL
Device: PRS-505, Jetbook, + Mini, +Color, Astak Ez Reader Pro, PPW1, Aura H2O
|
The question is, if you really need it?
Without looking at any statistics, I would say English books make up of over 50% of the world's books, most of them coming from USA, but also some from Australia, Great Britain, and other parts of the world. Again, without seeing any real statistics, I'd guess that 99% of these books don't need to use anything above the ISO-8859 standard. I'm trying to think about how many books have I read before, that used more than one language (except perhaps an occasional short sentence paraphrased)? Almost none, but I do see the potential for Latin writers writing in English, and having some "Spanglish" moments in their writing. I'm not trying to advocate for ISO-8859-1, but just mentioning that some people can make a big deal out of using UTF-8, when in reality, the book could have had the ISO-5889 coding and still looked exactly the same. But it is what it is... Big corporations determine what we need, even if we don't, and the flock just has to follow along... Kind of reminds me of my cellphone. Android decided that those phones really needed 64 bit, and a processor in it, so the phone barely lasts one day on a charge, while I still could think most apps and browsing on mobile, could still be done on 16 bit operating systems (32bit already was an unnecessary overhead). |
![]() |
![]() |
![]() |
#52 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
Quote:
Dale |
|
![]() |
![]() |
![]() |
#53 | |
Ex-Helpdesk Junkie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 19,421
Karma: 85400180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
|
Quote:
ISO-8859-1 and UTF-8 are pretty interchangeable, if all you are using is ASCII... So given that there is no downside whatsoever to using UTF-8, it is very difficult to know why anyone would want to avoid it. And at the end of the day, that is why people "can make a big deal out of using UTF-8". |
|
![]() |
![]() |
![]() |
#54 |
Karmaniac
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,553
Karma: 11499146
Join Date: Oct 2008
Location: Miami FL
Device: PRS-505, Jetbook, + Mini, +Color, Astak Ez Reader Pro, PPW1, Aura H2O
|
I wasn't looking forward to learn CSS, but now I think I'll make my epub in a program, and just modify (trim) the files inside the epub, to make it smaller, and repack it.
Anyone has a suggestion on a very basic epub creator program? I have downloaded Sigil, but wanted to know if there may be simpler versions around? |
![]() |
![]() |
![]() |
#55 |
Bookaholic
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 14,391
Karma: 54969924
Join Date: Oct 2007
Location: Minnesota
Device: iPad Mini 4, AuraHD, iPhone XR +
|
|
![]() |
![]() |
![]() |
#56 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 13,533
Karma: 78910202
Join Date: Nov 2007
Location: Toronto
Device: Libra H2O, Libra Colour
|
And remember if you use a tool like Sigil (or calibre editor) the CSS will just contain what you put in it; no more, no less.
|
![]() |
![]() |
![]() |
#57 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,729
Karma: 24031401
Join Date: Dec 2010
Device: Kindle PW2
|
Quote:
Both Sigil and Calibre Editor also support running Python scripts. I.e., you could write your own Python script to automatically run additional cleanup tasks. |
|
![]() |
![]() |
![]() |
#58 |
Ex-Helpdesk Junkie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 19,421
Karma: 85400180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
|
calibre has a new builtin feature to losslessly optimize images using jpegtran or optipng, available from either the Editor or Polish book. And has long offered a font subsetting feature.
So it isn't just "Sigil or extra standalone programs". ![]() |
![]() |
![]() |
![]() |
#59 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
Please be aware that a lot of people, myself included, do not like the UI (and other things) of the Calibre Editor. It is all personal preference.
|
![]() |
![]() |
![]() |
#60 |
Hmm.
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 124
Karma: 2016606
Join Date: Oct 2015
Device: Android 4.2 Google Play Reader
|
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Kindle not starting on start | warenjc | Amazon Kindle | 3 | 07-18-2013 12:08 PM |
Kindle not starting on start | warenjc | Kindle Formats | 6 | 07-17-2013 12:48 PM |
Kindle preview starting at #start in all formats but Fire | petercrowell | Kindle Formats | 6 | 05-09-2012 09:20 PM |
Kindle preview not starting at #start on all formats | petercrowell | Kindle Formats | 3 | 05-03-2012 09:58 AM |
PRS-T1 New reader start up questions | amyb | Sony Reader | 9 | 10-24-2011 03:54 PM |