02-25-2023, 11:16 AM | #16 |
Addict
Posts: 287
Karma: 2534928
Join Date: Nov 2022
Location: Canada
Device: Kobo Aura 2
|
According to this screenshot, the characters display fine when rendered as HTML, and are only strange in the source files. Is that accurate? If so then I am very curious about the contents of the stylesheets and in particular the rules governing the class "CharOverride-1". Or perhaps the relevant rule is related to the class in the surrounding div, the classname written in Russian which I will not attempt to type out for myself.
|
02-25-2023, 12:15 PM | #17 |
Sigil Developer
Posts: 8,094
Karma: 5450184
Join Date: Nov 2009
Device: many
|
Good point. Perhaps the CodeView font set in Sigil Preferences simply does not support those characters?
|
Advert | |
|
02-25-2023, 01:04 PM | #18 |
Grand Sorcerer
Posts: 27,901
Karma: 198500000
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
|
That's very possible. Monospace fonts with full utf-8 coverage can be hard to come by.
|
02-25-2023, 01:05 PM | #19 | |
Addict
Posts: 287
Karma: 2534928
Join Date: Nov 2022
Location: Canada
Device: Kobo Aura 2
|
Quote:
…that's what I think is going on, but without source files to check for myself, I'm just guessing from that singular Sigil screenshot. |
|
02-25-2023, 02:30 PM | #20 | |
Grand Sorcerer
Posts: 5,635
Karma: 23191067
Join Date: Dec 2010
Device: Kindle PW2
|
Quote:
The OP simply needs to convert the HTML files from CP1251 to UTF-8. |
|
Advert | |
|
02-25-2023, 11:29 PM | #21 | |
Enthusiast
Posts: 26
Karma: 10
Join Date: Feb 2023
Device: none
|
Quote:
|
|
02-26-2023, 02:19 AM | #22 |
Grand Sorcerer
Posts: 5,635
Karma: 23191067
Join Date: Dec 2010
Device: Kindle PW2
|
My recommendation was based on the text in screenshot in post 8. And it'll definitely work for such files, if you follow my instructions. Your screenshot shows this for Глава 1 (= Chapter 1):
Spoiler:
Code:
<p [...]>Ãëàâà 1</p> After conversion from CP1251 to UTF-8 it'll become: Code:
<p [...]>Глава 1</p> |
02-26-2023, 04:23 AM | #23 |
Sigil Developer
Posts: 8,094
Karma: 5450184
Join Date: Nov 2009
Device: many
|
I truly think Doitsu is correct. His example shows all of the characteristics of your screenshots and having an embedded Win-1251 font explains why Preview displays correctly while CodeView does not. His solution really should work for you.
Your epub is NOT protected. Just very poorly made without the proper utf-8 text encoding and utf-8 based fonts. |
02-26-2023, 06:36 AM | #24 | |
Resident Curmudgeon
Posts: 75,860
Karma: 134368292
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Quote:
|
|
02-26-2023, 06:49 AM | #25 | |
Enthusiast
Posts: 26
Karma: 10
Join Date: Feb 2023
Device: none
|
Quote:
|
|
02-26-2023, 06:53 AM | #26 |
Enthusiast
Posts: 26
Karma: 10
Join Date: Feb 2023
Device: none
|
Here is the original file of the book.
Last edited by DiapDealer; 02-26-2023 at 10:46 AM. |
02-26-2023, 07:59 AM | #27 | |
Enthusiast
Posts: 26
Karma: 10
Join Date: Feb 2023
Device: none
|
Quote:
Did I do everything right? If not, please explain more clearly. Maybe there is a video on YouTube that clearly solves this problem. |
|
02-26-2023, 10:48 AM | #28 |
Grand Sorcerer
Posts: 27,901
Karma: 198500000
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
|
Moderator Notice
Please do not post copyrighted ebooks to MobileRead. There are scrambling plugins that can be used if the structure of entire copyrighted epubs needs to be shared. Last edited by DiapDealer; 02-26-2023 at 10:50 AM. |
02-26-2023, 11:32 AM | #29 |
Evangelist
Posts: 495
Karma: 2267928
Join Date: Nov 2015
Device: none
|
Just extract the book as a single HTML file, then convert it to CP1252, then open it as CP1251 and save as UTF-8. There are many editors that can to that, including Notepad++ and VSCode.
|
02-26-2023, 01:24 PM | #30 |
Zealot
Posts: 114
Karma: 10
Join Date: Sep 2019
Location: Ukraine
Device: Computer, iPad
|
Online converters convert text fragments, but give different source encodings.
If you later paste this fragment into a Sigil, the text is read in both the Code View window and the Preview window. Only in the preview text is displayed without styles. I think this is because the conversion results in different character codes. For example: before conversion, the character "ñ" has the code 241, which corresponds to the letter "c" in СР1251; after conversion, the character "c" has the code 1089, which corresponds to the letter "c" in UTF-8. That's just the conversion of СР1251 to UTF-8 gives a deplorable result. |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Symbols | sky_kama | Library Management | 13 | 01-18-2013 05:10 AM |
Damnable Symbols | jgawne | Sigil | 33 | 03-07-2012 09:16 AM |
Any symbols not to use? | roguefan99 | Kobo Reader | 1 | 07-24-2010 10:21 AM |
How to convert a Word document into a Kindle document? | PS Kindle | Kindle Developer's Corner | 2 | 12-08-2009 08:40 PM |