View Single Post
Old 12-23-2024, 05:26 PM   #1
Shohreh
Addict
Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.
 
Posts: 207
Karma: 304158
Join Date: Jan 2016
Location: France
Device: none
Question [SOLVED] Making sense of faulty HTML to EPUB conversion

Hello,

I'm using a browser extension to convert HTML pages into EPUB files.

It works fine when the web page is in utf-8 but it doesn't like web pages encoded in iso-8859-1, where accented characters are replaced with question marks — the attached screenshots are when opening the EPUB file in SumatraPDF and Sigil.

To make matters worse, the extension replaces the encoding meta line with "charset="iso-8859-1".

I'd like to understand why accented characters are replaced with question marks. Is it a font issue? Or a problem with byte values?

Thank you.

---
Edit: I should have typed "the extension replaces the encoding meta line with "charset="utf-8"
Attached Thumbnails
Click image for larger version

Name:	145F168B-E62F-4872-91E4-AA56612D6581.png
Views:	174
Size:	35.5 KB
ID:	212622  

Last edited by Shohreh; 12-24-2024 at 08:51 AM.
Shohreh is offline   Reply With Quote