![]() |
#1 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,184
Karma: 32196
Join Date: Jan 2007
Location: Anchorage, AK
Device: Sony Reader PRS-505, PRS-650, PRS-T3, Pocketbook HD2
|
0.5.14 apostrophe's missing...
I converted a document to LRF just after updating to .5.14 and I noticed in the LRF that none of the apostrophe's carried over. I checked my html document and the apostrophe's aren't using any weird codes. they look like regular apostrophes.
I wasn't sure if I was the only one noticing this problem? edit: checked another file I converted with .5.13 and some of the apostrophe's are missing from it as well. ![]() Last edited by Amalthia; 06-22-2009 at 05:52 PM. |
![]() |
![]() |
![]() |
#2 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 644
Karma: 1242364
Join Date: May 2009
Location: The Right Coast
Device: PC (Calibre), Nexus 7 2013 (Moon+ Pro), HTC HD2/Leo (Freda)
|
My guess would be that LRF uses the apostrophe in a specific manner. Either the conversion process became confused (a missing closing apostrophe for instance) or the original file was not specifically designed for LRF output - so the apostrophes were handled incorrectly. Some ebook formats require that special characters use a specific sequence of characters or special coding.
|
![]() |
![]() |
Advert | |
|
![]() |
#3 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,184
Karma: 32196
Join Date: Jan 2007
Location: Anchorage, AK
Device: Sony Reader PRS-505, PRS-650, PRS-T3, Pocketbook HD2
|
Quote:
|
|
![]() |
![]() |
![]() |
#4 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,176
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
It's most likely caused by the apostrophe being a smart quote
|
![]() |
![]() |
![]() |
#5 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,184
Karma: 32196
Join Date: Jan 2007
Location: Anchorage, AK
Device: Sony Reader PRS-505, PRS-650, PRS-T3, Pocketbook HD2
|
|
![]() |
![]() |
Advert | |
|
![]() |
#6 |
Provocateur
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,859
Karma: 505847
Join Date: Feb 2009
Location: Columbus, OH
Device: Kindle Touch, Kindle 2, Kindle DX, iPhone 3GS
|
If you're using the latest Word, it's Office Button, Word Options, Proofing, AutoCorrect Options. Then uncheck everything in the AutoCorrect and AutoCorrect as you type tabs, specifically, the "straight quotes" with "smart quotes" repalcement option.
|
![]() |
![]() |
![]() |
#7 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,184
Karma: 32196
Join Date: Jan 2007
Location: Anchorage, AK
Device: Sony Reader PRS-505, PRS-650, PRS-T3, Pocketbook HD2
|
thanks! will implement this correction so this doesn't happen again!
|
![]() |
![]() |
![]() |
#8 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 875
Karma: 2676800
Join Date: Aug 2008
Location: Taranaki - NZ
Device: Kobo Aura H2O, Kobo Forma
|
Try opening your html in a text editor (not word!!) and changing/converting it to utf-8 instead of ANSI.
|
![]() |
![]() |
![]() |
#9 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,184
Karma: 32196
Join Date: Jan 2007
Location: Anchorage, AK
Device: Sony Reader PRS-505, PRS-650, PRS-T3, Pocketbook HD2
|
Quote:
it's the weirdest thing. I'm thinking I had to have changed some setting in Wordpad (but wordpad doens't have many settings to mess up!) I do almost all my html coding in word pad. Though sometimes i open it in Word and do the replace all stuff (am thinking of moving to Notepad++ instead) This is driving me a bit nuts now. the code works for one format conversion but not another and I'm not sure at what stage of the process the commas and dashes got converted to smart quotes or how to remove them from a document and switch them out with regular text. |
|
![]() |
![]() |
![]() |
#10 |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 79,051
Karma: 144284074
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
bump!
|
![]() |
![]() |
![]() |
#11 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,176
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
I need the HTML to debug this.
|
![]() |
![]() |
![]() |
#12 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,184
Karma: 32196
Join Date: Jan 2007
Location: Anchorage, AK
Device: Sony Reader PRS-505, PRS-650, PRS-T3, Pocketbook HD2
|
I think what had happened is that the doc file I original got had smart quotes, commas, dashes all over the place. I ended up going back to the original HTML postings of the story and merging it together. in my own Word program and I didn't have any problems from that.
I can try and attach the original file she sent to me if you think that would help? All I did with that is convert to html and load it to Calibre. though I'm now wondering if it's a unicode issue? Because when i've run across HTML pages that have blocks when I open I open it in Word, all I have to do is reopen the file in Notepad save as UTF-8 and then when I re-open the file in Word all the blocks are gone. |
![]() |
![]() |
![]() |
#13 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,176
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Certainly sounds lke a unicode issue.
|
![]() |
![]() |
![]() |
#14 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,184
Karma: 32196
Join Date: Jan 2007
Location: Anchorage, AK
Device: Sony Reader PRS-505, PRS-650, PRS-T3, Pocketbook HD2
|
I think the annoying part is I'm not sure how to check the unicode before I convert to html and load to Calibre. I really have to go through the file more carefully now because otherwise I'd miss the missing apostrophe's and the random blocks. Basically, this just threw me for a loop. i've been creating LRF files for some time now and it's only recently that I've had unicode problems pop up but I'm not sure why it's happening now instead of sooner. (I'm going to reformat my computer this upcoming weekend...I'll be starting off with a clean slate...so maybe this won't happen again)
|
![]() |
![]() |
![]() |
#15 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,184
Karma: 32196
Join Date: Jan 2007
Location: Anchorage, AK
Device: Sony Reader PRS-505, PRS-650, PRS-T3, Pocketbook HD2
|
Update: I think I may have figured out what I was doing wrong.
In most HTML conversions this line is added. "<meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />" However, I was just deleting all the metadata and I think for some documents that wasn't a good thing to do? Not sure if this theory will pan out or is correct but I think for the time being I'll make sure I don't delete that piece of metadata code. I only started deleting the metadata recently which is pretty much when I started to notice that I was missing apostrophes and getting blocks instead of commas. |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
PRS-600 Am I just missing something here?! | linzylou63 | Sony Reader | 0 | 09-03-2010 02:44 PM |
Am I missing something here? | SavalBork | Alternative Devices | 3 | 08-27-2010 09:17 PM |
Um am I missing something? | hpjrt | Kobo Reader | 5 | 08-12-2010 12:27 AM |
Hello, this is what I've been missing. | Elimad | Introduce Yourself | 10 | 06-24-2010 08:06 PM |
Missing covers, missing content. Getting worse with each sync. | Mememememe | Kobo Reader | 7 | 06-16-2010 09:02 AM |