12-30-2010, 07:10 AM | #1 |
Captain Courageous
Posts: 239
Karma: 102
Join Date: Apr 2009
Device: calibre, PRS 505
|
Those pesky <?> !
I get these "?" characters in place of ' and " in my txt files when they are converted to epubs. I've tried saving these files as different character encoding including plain ascii, UTF-8 and Unicode and I simply can't get rid of them. It makes my epubs look horrible. When I loaded the txt file into word and chose to import using UTF-8 I didn't have the "?" but instead there were no ' and " at all. When I saved it as an rtf and converted that to epub, the ? were back.
Incidentally, I use The command line command ebook-convert exclusively to do my conversions. Could someone help me with this? Thanks, Paul |
12-30-2010, 07:57 AM | #2 |
Grand Sorcerer
Posts: 6,212
Karma: 16534894
Join Date: Sep 2009
Location: UK
Device: Kobo: KA1, ClaraHD, Forma, Libra2, Clara2E. PocketBook: TouchHD3
|
If you can post a section of your TXT file containing some badly-behaving quotes, I'll have a go. A section of the original TXT before you started experimenting would be preferable.
|
Advert | |
|
12-30-2010, 11:24 AM | #3 |
Captain Courageous
Posts: 239
Karma: 102
Join Date: Apr 2009
Device: calibre, PRS 505
|
Ok here's a sample cut & pasted.
?I think he?s only violent with their dog. But if this Carl does want to take a whack at me, that?s okay, ?cause I have you.? ?Me? I?m an architect.? ?Not tonight, sweetie. Tonight, you?re muscle.? Brian had accompanied her on other missions like this, but never previously after midnight to the home of a crazy violent drunk. ?What if I have a testosterone deficiency?? I don't know what good it will do, but there it is Thanks, Paul |
12-30-2010, 12:26 PM | #4 |
Well trained by Cats
Posts: 29,804
Karma: 54830978
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
|
12-30-2010, 12:36 PM | #5 | |
Wizard
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
Quote:
|
|
Advert | |
|
12-30-2010, 03:38 PM | #6 |
Captain Courageous
Posts: 239
Karma: 102
Join Date: Apr 2009
Device: calibre, PRS 505
|
OK, Sorry about that here it is. file save was ANSI
Thanks, Paul |
12-30-2010, 03:55 PM | #7 |
Grand Sorcerer
Posts: 6,212
Karma: 16534894
Join Date: Sep 2009
Location: UK
Device: Kobo: KA1, ClaraHD, Forma, Libra2, Clara2E. PocketBook: TouchHD3
|
I could be wrong, but if that is your original source, it looks as if the quotes characters have already been lost.
|
12-30-2010, 08:00 PM | #8 |
Well trained by Cats
Posts: 29,804
Karma: 54830978
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
|
12-30-2010, 08:50 PM | #9 |
Captain Courageous
Posts: 239
Karma: 102
Join Date: Apr 2009
Device: calibre, PRS 505
|
It was html at first. I used ebook-convert to convert it to text. All appeared normal and I converted it then from txt to epub. That when the ? replaced the quote marks. I went back then saved the html page as text. That's when the quote marks were replaced by ? in the text file. The file you saw was from the html saved as text.
Paul |
12-30-2010, 09:19 PM | #10 | |
Well trained by Cats
Posts: 29,804
Karma: 54830978
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
'Text' is the lower 128 ASCII chart. Convert the Original HTML with CP1252 specified and all should go well. |
|
12-30-2010, 10:04 PM | #11 |
Captain Courageous
Posts: 239
Karma: 102
Join Date: Apr 2009
Device: calibre, PRS 505
|
Is that Western (Windows-1252) in FF under "View->Character Encoding" ?
Thanks, Paul |
12-31-2010, 03:20 AM | #12 | |
US Navy, Retired
Posts: 9,864
Karma: 13806776
Join Date: Feb 2009
Location: North Carolina
Device: Icarus Illumina XL HD, Nexus 7
|
Quote:
Sorry I can't help you with the command line interface. |
|
01-01-2011, 01:23 AM | #13 |
Captain Courageous
Posts: 239
Karma: 102
Join Date: Apr 2009
Device: calibre, PRS 505
|
At last I used --input-encoding "Windows-1252" on the command line to force the encoding to 1252. that was the only thing that would work. Even though FF "said" the encoding was 1252, it wasn't. I will do this from now on when I have an HTML file.
Thanks to all my helpers! Paul |
01-01-2011, 01:30 AM | #14 | |
US Navy, Retired
Posts: 9,864
Karma: 13806776
Join Date: Feb 2009
Location: North Carolina
Device: Icarus Illumina XL HD, Nexus 7
|
Quote:
|
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Pesky PDF to RTF | enarchay | 11 | 06-15-2009 12:40 PM |