![]() |
Elipsis character displays in code view but not in book view
I'm trying to fix a book that contains many sentences that have no closing punctuation -- in this case an elipsis. These sentences all end a paragraph, and in book view there is no elipsis visible. When I look at the sentences in code view, though, there is a three-dot character that looks like an elipsis, but it's not the exact same elipsis you get when you click special characters. There's more space between the special character elipsis.
I thought it would be a simple fix to have Sigil search for the rouge elipses and replace them with the special character elipses. But when I copy the rogue elipsis out of code view and do a search for them, Sigil doesn't find them. Of course when I try to paste the code into this forum post, the rouge character also doesn't appear, which I guess is why the character doesn't appear in Sigil's book view. The code looks like this: <p class="calibre6">"But I would not be able to get another job, with no academic record, no past work history"</p> In code view, there are three little dots after the word history. The sentences in question aren't broken paragraphs, though. They're supposed to end with an elipsis because the speaker has been interrupted by another person in the book, and his quote follows. If anyone knows what the elipsis-looking characters are and how to replace them, I'd be most grateful. Thanks! |
Quote:
& hellip; or & #8230; (no space in either) |
Quote:
& hellip; and & #8230 are html codes for an ellipsis, correct? |
Quote:
(and they usually show) No fonts in the font folder just means the fonts are not EMBEDED The line in the stylesheet font-family: "Times New Roman", serif; Anything other than serif or sans-serif indicates a specific font face call When the font is not embeded, it is supplied by the system. If the system does not have it, the generic alternate is used (serif or sans-serif part) |
I would be amazed if there is a standard font on a reader that does not have the ellips on board. What you can do is extract the XTHML and open it in a hex editor to check the actual (hex)code of the ellips. That should help in solving this.
|
Quote:
What now? Thanks! |
Quote:
|
Right click on the document in the Book Browser on the left. Select 'Save as'. There are various free hex editors out there. I used XVI32 in the past.
|
Quote:
tell me—" And the corresponding hex numbers: 74 65 6C 6C 20 6D 65 C2 97 22 What now? Thanks! |
Quote:
that & Agrave; should have been a & rdquo; |
Quote:
Hitch |
Quote:
Hitch caught me again :o |
Quote:
|
Oops! I posted the above reply before I saw the latest from Hitch and theducks. I'm still lost, though.
|
Quote:
But Hitch may be closer (She does do this for a living), with the idea that this is a Wordperfect artifact. Did she get that correct? (WP had it's own way of encoding some characters) |
Hmm, it might indeed be a non-unicode conversion issue. What is the source and how did you make an ePUB from it?
|
Quote:
@magmanpi: You say you're "fixing" this book? Presumably for someone? Who's going to, what, publish this? And you're using Calibre to fix it? Hitch |
Quote:
As you suggested, I opened the book in a hex editor, which allowed me to successfully do a search and replace for the circumflex characters. After correcting the errors -- always a missing ellipsis or emdash that caused the words on each side of it to run together, I copied and pasted the corrected file back into Sigil and deleted the original file. The book appears to read fine now. I'm still not sure what caused the rogue characters to appear in the first place, but at least now I have a readable book and I'll know how to fix the problem if it occurs in the future. Thanks, everyone, for all the help! :thanks: |
Quote:
Quote:
Moreover, there's really no reason for this to occur "again in the future" once you understand what caused it, and what you need to do to prevent that from happening. Which might motivate you to tell us what that source was, so someone here can tell you how to get around the issue of all of it appearing in the first place. Particularly if, as I infer from your penultimate paragraph, you're planning on cleaning or fixing or making ePUBs as an ongoing concern. Hitch |
Yup, probably the export did not specify to use UTF-8. I know for my add-in that I do that very specific to avoid issues.
|
| All times are GMT -4. The time now is 10:08 PM. |
Powered by: vBulletin
Copyright ©2000 - 3.8.5, Jelsoft Enterprises Ltd.
MobileRead.com is a privately owned, operated and funded community.