Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Conversion

Notices

Reply
 
Thread Tools Search this Thread
Old 03-07-2011, 11:08 PM   #16
ldolse
Wizard
ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.
 
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
Quote:
Originally Posted by user_none View Post
It's already supposed to...
Bug then? I did a bit of googling and it seems like Python has some quirks in it's handling of BOMs across different unicode encodings. I Didn't look very hard though.

Last edited by ldolse; 03-08-2011 at 02:15 AM.
ldolse is offline   Reply With Quote
Old 03-08-2011, 02:12 AM   #17
getajob
Junior Member
getajob began at the beginning.
 
Posts: 7
Karma: 10
Join Date: Oct 2010
Location: Australia
Device: Kindle 3, iPhone 3G, iPad 2 (on order)
Quote:
Originally Posted by ldolse View Post
You're mostly correct except for one part of your solution:

UTF-8 is unicode...
Fair point. Perhaps I should have said:

If you have to use Unicode as your 'Save as' option in WordPad, then set Input character encoding to UTF-16

It was not intuitive to me that if I select the 'Unicode' option in the Save As dialog of WordPad then that meant that I needed to set the Input character encoding to UTF-16. UTF-16 doesn't even appear on the dropdown as an option...
Quote:
so there is no need to use UTF-16 ever.
Probably true.

Regarding both 'ANSI' and 'Unicode', I was actually referring to labels that Microsoft has chosen to use with both WordPad and Notepad, nothing more... I was trying to assist anyone that might fall into the trap that I just fell into.

Thanks everyone who helped me - I learnt a lot.
getajob is offline   Reply With Quote
Advert
Old 03-08-2011, 06:56 AM   #18
user_none
Sigil & calibre developer
user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.
 
user_none's Avatar
 
Posts: 2,487
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
Quote:
Originally Posted by ldolse View Post
Bug then? I did a bit of googling and it seems like Python has some quirks in it's handling of BOMs across different unicode encodings. I Didn't look very hard though.
Then it's in chardet which TXT input uses for detecting the encoding. I haven't had time but I need to look at the input file to see if it actually has a BOM. chardet will only detect UTF-16 if it has a valid BE or LE BOM.
user_none is offline   Reply With Quote
Old 03-09-2011, 07:34 AM   #19
getajob
Junior Member
getajob began at the beginning.
 
Posts: 7
Karma: 10
Join Date: Oct 2010
Location: Australia
Device: Kindle 3, iPhone 3G, iPad 2 (on order)
Quote:
Originally Posted by getajob View Post
WordPad has four 'Save As' options: ANSI, Unicode, Unicode big endian and UTF-8. Notepad is the same.
I have no explanation for this but yesterday WordPad had an Encoding dropdown option in the Save As exactly the same as Notepad - but not today.

Notepad still allows me to Save As with Encoding options of ANSI, Unicode, Unicode big endian and UTF-8 but WordPad will no longer co-operate at all.

Damn MS rubbish!

I have switched to Notepad++ - it has much better encoding control.
getajob is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Word Processing on the Kindle 3 cow_trix Amazon Kindle 41 05-17-2011 03:22 AM
Textile conversion broken in 7.45 Perkin Conversion 7 02-12-2011 06:36 PM
New edition of The Textile Planet; read chapter one for free [see post #14] suelange Self-Promotions by Authors and Publishers 14 09-29-2010 10:33 AM
Comic File Processing wonderboy Other formats 1 08-08-2009 04:17 AM
Perl processing alexxxm Sony Reader 3 11-26-2007 06:13 AM


All times are GMT -4. The time now is 01:26 PM.


MobileRead.com is a privately owned, operated and funded community.