Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Sigil

Notices

Reply
 
Thread Tools Search this Thread
Old 03-01-2014, 01:55 AM   #16
Toxaris
Wizard
Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.
 
Toxaris's Avatar
 
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
Hmm, it might indeed be a non-unicode conversion issue. What is the source and how did you make an ePUB from it?
Toxaris is offline   Reply With Quote
Old 03-01-2014, 03:36 AM   #17
Hitch
Bookmaker & Cat Slave
Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.
 
Hitch's Avatar
 
Posts: 11,503
Karma: 158448243
Join Date: Apr 2010
Location: Phoenix, AZ
Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2
Quote:
Originally Posted by Toxaris View Post
Hmm, it might indeed be a non-unicode conversion issue. What is the source and how did you make an ePUB from it?
I'm sure that's exactly what it is, but it feels like Wordperfect to me. Possibly a Pages-->Doc conversion, but WP feels right. (Or...hell, it could be something out of one of those crappy "save your PDF to Word!" websites).

@magmanpi:

You say you're "fixing" this book? Presumably for someone? Who's going to, what, publish this? And you're using Calibre to fix it?

Hitch
Hitch is offline   Reply With Quote
Advert
Old 03-02-2014, 09:47 PM   #18
magmanpi
Enthusiast
magmanpi can extract oil from cheesemagmanpi can extract oil from cheesemagmanpi can extract oil from cheesemagmanpi can extract oil from cheesemagmanpi can extract oil from cheesemagmanpi can extract oil from cheesemagmanpi can extract oil from cheesemagmanpi can extract oil from cheese
 
Posts: 30
Karma: 1000
Join Date: Nov 2012
Device: none
Quote:
Originally Posted by Toxaris View Post
Hmm, it might indeed be a non-unicode conversion issue. What is the source and how did you make an ePUB from it?
The source is an html file that I converted to ePub with Calibre. The conversion is pretty good except for the multitude of run-together words apparently caused by the circumflex characters that are visible only in Sigil's code view and not book view. But even though the circumflex characters are visible in code view, Sigil doesn't find them when I copy them into Sigil's search field.

As you suggested, I opened the book in a hex editor, which allowed me to successfully do a search and replace for the circumflex characters. After correcting the errors -- always a missing ellipsis or emdash that caused the words on each side of it to run together, I copied and pasted the corrected file back into Sigil and deleted the original file. The book appears to read fine now.

I'm still not sure what caused the rogue characters to appear in the first place, but at least now I have a readable book and I'll know how to fix the problem if it occurs in the future.

Thanks, everyone, for all the help!
magmanpi is offline   Reply With Quote
Old 03-03-2014, 04:24 AM   #19
Hitch
Bookmaker & Cat Slave
Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.
 
Hitch's Avatar
 
Posts: 11,503
Karma: 158448243
Join Date: Apr 2010
Location: Phoenix, AZ
Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2
Quote:
Originally Posted by magmanpi View Post
The source is an html file that I converted to ePub with Calibre. The conversion is pretty good except for the multitude of run-together words apparently caused by the circumflex characters that are visible only in Sigil's code view and not book view. But even though the circumflex characters are visible in code view, Sigil doesn't find them when I copy them into Sigil's search field.
Yes, but what we re all asking is, "HTML file made from WHAT, and how?" An HTML file is (generally) the output of a program--Word, wordperfect, Pages, AbbyyFineReader, etc. Do you have any idea what the source was, just out of curiosity?

Quote:
As you suggested, I opened the book in a hex editor, which allowed me to successfully do a search and replace for the circumflex characters. After correcting the errors -- always a missing ellipsis or emdash that caused the words on each side of it to run together, I copied and pasted the corrected file back into Sigil and deleted the original file. The book appears to read fine now.

I'm still not sure what caused the rogue characters to appear in the first place, but at least now I have a readable book and I'll know how to fix the problem if it occurs in the future.

Thanks, everyone, for all the help!
The conversion from word-processing file (or scanned file, etc.) to HTML, is the likely cause, and some lack of attention to the file encoding when it was subsequently uploaded to Sigil is what caused it. We'd all like to know what your source file was-at least, I would--just because that's the type of stuff we like to know.

Moreover, there's really no reason for this to occur "again in the future" once you understand what caused it, and what you need to do to prevent that from happening. Which might motivate you to tell us what that source was, so someone here can tell you how to get around the issue of all of it appearing in the first place. Particularly if, as I infer from your penultimate paragraph, you're planning on cleaning or fixing or making ePUBs as an ongoing concern.

Hitch
Hitch is offline   Reply With Quote
Old 03-03-2014, 04:45 AM   #20
Toxaris
Wizard
Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.
 
Toxaris's Avatar
 
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
Yup, probably the export did not specify to use UTF-8. I know for my add-in that I do that very specific to avoid issues.
Toxaris is offline   Reply With Quote
Advert
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Enter text in Book View or Code View ronaldl Sigil 5 10-29-2012 03:12 PM
replace in book view changes view to code view cybmole Sigil 4 10-28-2012 01:20 PM
Sigil highlight Book View No Longer Shows in Code View Themus Sigil 4 10-04-2012 07:54 PM
quotes differences book view & code view cybmole Sigil 13 03-29-2011 01:53 AM
lock book view & code view windows into synch cybmole Sigil 5 01-19-2011 10:30 PM


All times are GMT -4. The time now is 12:58 AM.


MobileRead.com is a privately owned, operated and funded community.