Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Formats > ePub

Notices

Reply
 
Thread Tools Search this Thread
Old 02-22-2023, 09:52 AM   #1
KIE18
Enthusiast
KIE18 began at the beginning.
 
Posts: 26
Karma: 10
Join Date: Feb 2023
Device: none
The document opens with symbols

The document opens with symbols. Is there any way to fix this in sigil?
About the same as in the picture, only in the sigil program.
Attached Thumbnails
Click image for larger version

Name:	9cfcbf0188a8da25b857aeefe9b07158.jpeg
Views:	218
Size:	221.9 KB
ID:	199855  

Last edited by theducks; 02-22-2023 at 10:20 AM. Reason: converted to attachment
KIE18 is offline   Reply With Quote
Old 02-22-2023, 09:59 AM   #2
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 7,469
Karma: 5432724
Join Date: Nov 2009
Device: many
Could be incorrect text encoding? Or possibly the text is encrypted by DRM but there are too many clear text English phrases.

How was this text encoded originally?

Sigil is probably not the correct tool if this is pdf file.

Last edited by KevinH; 02-22-2023 at 10:07 AM.
KevinH is offline   Reply With Quote
Advert
Old 02-22-2023, 12:26 PM   #3
Doitsu
Grand Sorcerer
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 5,582
Karma: 22735033
Join Date: Dec 2010
Device: Kindle PW2
Quote:
Originally Posted by KIE18 View Post
The document opens with symbols. Is there any way to fix this in sigil?
Looks like Mojibake. You might be able to fix it with ftfy.
Doitsu is offline   Reply With Quote
Old 02-22-2023, 12:42 PM   #4
Sarmat89
Evangelist
Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.
 
Posts: 482
Karma: 2267928
Join Date: Nov 2015
Device: none
Set the Encoding for the intermediate text: 2nd icon down, "Text" tab. Set it to CP1251.
Sarmat89 is online now   Reply With Quote
Old 02-23-2023, 12:56 AM   #5
KIE18
Enthusiast
KIE18 began at the beginning.
 
Posts: 26
Karma: 10
Join Date: Feb 2023
Device: none
Quote:
Originally Posted by KevinH View Post
Could be incorrect text encoding? Or possibly the text is encrypted by DRM but there are too many clear text English phrases.

How was this text encoded originally?

Sigil is probably not the correct tool if this is pdf file.
And how to remove this DRM?
KIE18 is offline   Reply With Quote
Advert
Old 02-23-2023, 12:58 AM   #6
KIE18
Enthusiast
KIE18 began at the beginning.
 
Posts: 26
Karma: 10
Join Date: Feb 2023
Device: none
Quote:
Originally Posted by Sarmat89 View Post
Set the Encoding for the intermediate text: 2nd icon down, "Text" tab. Set it to CP1251.
I didn't understand what to do and where to click.
KIE18 is offline   Reply With Quote
Old 02-23-2023, 01:01 AM   #7
KIE18
Enthusiast
KIE18 began at the beginning.
 
Posts: 26
Karma: 10
Join Date: Feb 2023
Device: none
Quote:
Originally Posted by Doitsu View Post
Looks like Mojibake. You might be able to fix it with ftfy.
It didn't help.
KIE18 is offline   Reply With Quote
Old 02-23-2023, 01:31 AM   #8
KIE18
Enthusiast
KIE18 began at the beginning.
 
Posts: 26
Karma: 10
Join Date: Feb 2023
Device: none
Quote:
Originally Posted by KevinH View Post
Could be incorrect text encoding? Or possibly the text is encrypted by DRM but there are too many clear text English phrases.

How was this text encoded originally?

Sigil is probably not the correct tool if this is pdf file.
Finally figured out how to upload a screenshot.
Attached Thumbnails
Click image for larger version

Name:	2023-01-29_15-23-08.png
Views:	192
Size:	234.0 KB
ID:	199867  
KIE18 is offline   Reply With Quote
Old 02-23-2023, 02:20 AM   #9
User_Z
Connoisseur
User_Z began at the beginning.
 
Posts: 95
Karma: 10
Join Date: Sep 2019
Location: Ukraine
Device: Computer, iPad
Ваш вопрос немного не из этой оперы.

Попробуйте другие варианты для изменения кодировки текста или кодовой страницы.
Например:

- Измените кодировку в Блокноте. Там это называется "Набор символов" в настройках шрифта (программа, скрин которой вы показали сначала);

- загрузите ваш текстовый файл в Notepad++ и попробуйте в меню "Кодировки" установить правильную;

- откройте текстовый файл для редактирования в Far Manager и по "Shift+F8" тоже попробуйте установить правильную.

Выглядит приблизительно так:
Attached Thumbnails
Click image for larger version

Name:	bloknot.png
Views:	157
Size:	20.5 KB
ID:	199868   Click image for larger version

Name:	npp.png
Views:	140
Size:	74.0 KB
ID:	199869   Click image for larger version

Name:	far.png
Views:	157
Size:	103.2 KB
ID:	199870  
User_Z is offline   Reply With Quote
Old 02-23-2023, 04:57 AM   #10
KIE18
Enthusiast
KIE18 began at the beginning.
 
Posts: 26
Karma: 10
Join Date: Feb 2023
Device: none
Quote:
Originally Posted by User_Z View Post
Ваш вопрос немного не из этой оперы.

Попробуйте другие варианты для изменения кодировки текста или кодовой страницы.
Например:

- Измените кодировку в Блокноте. Там это называется "Набор символов" в настройках шрифта (программа, скрин которой вы показали сначала);

- загрузите ваш текстовый файл в Notepad++ и попробуйте в меню "Кодировки" установить правильную;

- откройте текстовый файл для редактирования в Far Manager и по "Shift+F8" тоже попробуйте установить правильную.

Выглядит приблизительно так:
Первые два варианта не помогли.
KIE18 is offline   Reply With Quote
Old 02-24-2023, 12:34 AM   #11
Doitsu
Grand Sorcerer
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 5,582
Karma: 22735033
Join Date: Dec 2010
Device: Kindle PW2
Your book is encoded as WIN-1251, however, epubs need to be encoded as utf-8 or utf-16. You'll need to unzip the epub file and convert all html files to utf-8. The following article might help: Как перекодировать 1251 в UTF-8?
Doitsu is offline   Reply With Quote
Old 02-24-2023, 05:01 AM   #12
KIE18
Enthusiast
KIE18 began at the beginning.
 
Posts: 26
Karma: 10
Join Date: Feb 2023
Device: none
Quote:
Originally Posted by Doitsu View Post
Your book is encoded as WIN-1251, however, epubs need to be encoded as utf-8 or utf-16. You'll need to unzip the epub file and convert all html files to utf-8. The following article might help: Как перекодировать 1251 в UTF-8?
Using the program that is specified in the article, I specified the folder with the file and clicked "start". But the file remained the same. Maybe because you need to unzip the epub somehow? How to do it?
KIE18 is offline   Reply With Quote
Old 02-24-2023, 06:45 AM   #13
Doitsu
Grand Sorcerer
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 5,582
Karma: 22735033
Join Date: Dec 2010
Device: Kindle PW2
Quote:
Originally Posted by KIE18 View Post
Maybe because you need to unzip the epub somehow? How to do it?
Simply rename the .epub file to .zip and extract all files. After the conversion you'll need to zip it up again and rename it to .epub.
Doitsu is offline   Reply With Quote
Old 02-25-2023, 09:08 AM   #14
KIE18
Enthusiast
KIE18 began at the beginning.
 
Posts: 26
Karma: 10
Join Date: Feb 2023
Device: none
Quote:
Originally Posted by Doitsu View Post
Simply rename the .epub file to .zip and extract all files. After the conversion you'll need to zip it up again and rename it to .epub.
It didn't work out. Everything remains the same. Before that, the file was also in UTF-8.
Attached Thumbnails
Click image for larger version

Name:	2023 до.png
Views:	76
Size:	146.3 KB
ID:	199928   Click image for larger version

Name:	2023-02-25_16-44-36.png
Views:	75
Size:	133.0 KB
ID:	199929  
KIE18 is offline   Reply With Quote
Old 02-25-2023, 10:21 AM   #15
Doitsu
Grand Sorcerer
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 5,582
Karma: 22735033
Join Date: Dec 2010
Device: Kindle PW2
Quote:
Originally Posted by KIE18 View Post
It didn't work out. Everything remains the same. Before that, the file was also in UTF-8.
In that case the automatic codepage detection of the converter failed because the original files contained the following declaration:
Code:
<?xml version="1.0" encoding="utf-8" standalone="no"?>
As User_Z has already suggested, you could use Notepad++ to fix the encoding:
  • Open the file with it (Notepad++ should detect the encoding as Cyrillic > Macintosh or Cyrillic > Win-1251. If it doesn't, select Win-1251.)
  • Press CTRL+A to select all text, then press CTRL+C to copy the text to the clipboard.
  • Close the original file.
  • Select File > New and press CTRL+V to paste the clipboard contents into the new file.
  • Save the new file under the same name as the original file.
This'll definitely work, however, since the epub contains multiple files, you might want to search the Russian Internet for batch converters with support for Cyrillic encodings that allow you to manually select in the input and output encodings.
Doitsu is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Symbols sky_kama Library Management 13 01-18-2013 05:10 AM
Damnable Symbols jgawne Sigil 33 03-07-2012 09:16 AM
Any symbols not to use? roguefan99 Kobo Reader 1 07-24-2010 10:21 AM
How to convert a Word document into a Kindle document? PS Kindle Kindle Developer's Corner 2 12-08-2009 08:40 PM


All times are GMT -4. The time now is 07:52 AM.


MobileRead.com is a privately owned, operated and funded community.