View Single Post
Old 06-16-2010, 10:10 PM   #17
DoctorOhh
US Navy, Retired
DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.
 
DoctorOhh's Avatar
 
Posts: 8,863
Karma: 12755553
Join Date: Feb 2009
Location: North Carolina
Device: Nexus 7
Quote:
Originally Posted by rheostaticsfan View Post
I bought the book then ran it through ereader2html then input the html into calibre. The output of ereader2html looks fine, but when I click V in calibre it shows me a winzip folder (it imported as zip). If I open the book file in there the em dashes and apostrophes are replaced with squares.
You have to set the input encoding prior to adding the html file to calibre. Once you see little squares you know to remove the html set a different encoding and try adding the html back to calibre.

Quote:
Originally Posted by rheostaticsfan View Post
Quote:
Originally Posted by kovidgoyal View Post
http://calibre-ebook.com/user_manual/faq.html#id15
I saw that. It seems to tell me to input the proper encoding. But that's how I started the thread. How do I know what the proper encoding is?

Or is there something I'm missing???
Kovid is pointing you to the area of the manual that tells you to set the encoding prior to adding the html to calibre.

Quote:
2. When adding HTML files to calibre, you may need to tell calibre what encoding the files are in. To do this go to Preferences->Plugins->File Type plugins and customize the HTML2Zip plugin, telling it what encoding your HTML files are in. Now when you add HTML files to calibre they will be correctly processed. HTML files from different sources often have different encodings, so you may have to change this setting repeatedly. A common encoding for many files from the web is cp1252 and I would suggest you try that first. Note that when converting HTML files, leave the input encoding setting mentioned above blank. This is because the HTML2ZIP plugin automatically converts the HTML files to a standard encoding (utf-8).

Last edited by DoctorOhh; 06-16-2010 at 10:14 PM.
DoctorOhh is offline   Reply With Quote