Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Conversion

Notices

Reply
 
Thread Tools Search this Thread
Old 02-21-2022, 09:16 AM   #1
ColMac
Connoisseur
ColMac began at the beginning.
 
Posts: 59
Karma: 10
Join Date: Apr 2012
Device: Kindle Fire
em dash conversion error

I'm looking at a book in the calibre edit book panels

I have a lot of em dashes which have apparently converted wrongly., and on the right hand panel file preview, it is displaying as a small empty square. On my Kindle, it displays as a rectangle with two "0"s at the top, and 97 below that.

It is easy enough to replace each one I find, but in the HTML view, the em dash is simply displaying as a space. Unfortunately not all of these em dashes were at the end of a line and followed by a ".

In many cases, they are in the middle of a sentence. I may be able to find them by searching for a double space, but presumably it must be a different character.

Any suggestions on what I should search for please.
Attached Thumbnails
Click image for larger version

Name:	em.jpg
Views:	124
Size:	34.4 KB
ID:	192415  
ColMac is offline   Reply With Quote
Old 02-21-2022, 10:23 AM   #2
retiredbiker
Evangelist
retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.
 
retiredbiker's Avatar
 
Posts: 447
Karma: 3886916
Join Date: May 2013
Location: Ontario, Canada
Device: Kindle KB, Oasis, Pop_Os!, Jutoh, Kobo Forma
That 0097 you see is the character you can't see - it is U+0097 END OF UNGUARDED AREA. If you put the cursor just after one of these in the editor and look down to the bottom right of the screen it should show you the character name.

So it is a little tricky, but you can actually select that invisible character with shift+Left Arrow, copy it, and paste it into the Find box. Then replace it with the em-dash.

There may be other punctuation done with some invisible arcane characters. I have found a very few old books completely punctuated with these bizarre things. I used to be able to "see" these characters on my PC as funny little pictures in some applications, but some update to my OS must have changed something, and they are now invisible everywhere, so it is more difficult.
Attached Thumbnails
Click image for larger version

Name:	character.jpg
Views:	108
Size:	63.7 KB
ID:	192416  

Last edited by retiredbiker; 02-21-2022 at 10:26 AM.
retiredbiker is offline   Reply With Quote
Old 02-21-2022, 11:00 AM   #3
ColMac
Connoisseur
ColMac began at the beginning.
 
Posts: 59
Karma: 10
Join Date: Apr 2012
Device: Kindle Fire
Many many thanks, for a perfect explanation, and so fast too. I had not spotted the character names at the bottom right, so that's a great tip for me too.

It turned out I had 55 of them, so it was a lot easier using the search & replace. (I had feared there might be more!)
ColMac is offline   Reply With Quote
Old 02-21-2022, 04:19 PM   #4
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 21,660
Karma: 29711016
Join Date: Mar 2012
Location: Sydney Australia
Device: none
@ColMac - Curious - what formats did you convert from and to?

The editor's Reports Tool can provide a list of the characters used in a book, I use it to determine if a book has any 'unusual' characters and track them down of necessary (it's a clickable list):

Click image for larger version

Name:	Screenshot 2022-02-22 080007.jpg
Views:	245
Size:	220.5 KB
ID:	192422

The list at the bottom, and individual cells (e.g the codepoint value), can be copied to the clipboard which can be handy at times.

BR
BetterRed is offline   Reply With Quote
Old 02-21-2022, 04:57 PM   #5
retiredbiker
Evangelist
retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.
 
retiredbiker's Avatar
 
Posts: 447
Karma: 3886916
Join Date: May 2013
Location: Ontario, Canada
Device: Kindle KB, Oasis, Pop_Os!, Jutoh, Kobo Forma
I've only ever seen these in quite old books from an iffy Russian site. When I find one it is usually totally punctuated this way, and the character report looks like this, with the first five characters being the punctuation. I haven't seen one for a while; kept one epub as a curiosity; can't remember if they were in other formats as well.
Attached Thumbnails
Click image for larger version

Name:	character.jpg
Views:	101
Size:	134.7 KB
ID:	192423  
retiredbiker is offline   Reply With Quote
Old 02-21-2022, 05:49 PM   #6
Karellen
Wizard
Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.
 
Karellen's Avatar
 
Posts: 1,611
Karma: 9500498
Join Date: Sep 2021
Location: Australia
Device: Kobo Libra 2
Quote:
Originally Posted by BetterRed View Post
The list at the bottom, and individual cells (e.g the codepoint value), can be copied to the clipboard which can be handy at times.
What is that string of characters at the bottom of the list - Withasg.eldrnobfwvp.....
Is that what can be copied? What do you then do with it?
Thanks
Karellen is offline   Reply With Quote
Old 02-21-2022, 06:41 PM   #7
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 21,660
Karma: 29711016
Join Date: Mar 2012
Location: Sydney Australia
Device: none
My 'handy at times' implies ad-hoc usage.

Like most things in calibre, the string at the bottom has a context menu.

Example: I pasted a string that had rupee ₹ and rial ﷼ symbols onto a sticky note and used it to update Transtools Insert Symbol tool.

BR

Last edited by BetterRed; 02-21-2022 at 06:43 PM.
BetterRed is offline   Reply With Quote
Old 02-21-2022, 11:57 PM   #8
Karellen
Wizard
Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.
 
Karellen's Avatar
 
Posts: 1,611
Karma: 9500498
Join Date: Sep 2021
Location: Australia
Device: Kobo Libra 2
Quote:
Originally Posted by BetterRed View Post
My 'handy at times' implies ad-hoc usage.

Like most things in calibre, the string at the bottom has a context menu.

Example: I pasted a string that had rupee ₹ and rial ﷼ symbols onto a sticky note and used it to update Transtools Insert Symbol tool.

BR
Sorry, I am not on the ball with this.

Yes I can copy this string using the context menu.

Quote:
overid_csCImagJhnGTtlPHELNSB09581XRuUK.42Af,MywW6( )VxFpb3kj’qD©7O“Yz?”!-Q—;é‘:&–/…*Z
But what is it?
What is it used for?
It does not change as I select the different characters in the list above it, but it is on the same page so I guess it is somehow related to the character list, but I have no idea what it is.
Karellen is offline   Reply With Quote
Old 02-22-2022, 05:53 AM   #9
ColMac
Connoisseur
ColMac began at the beginning.
 
Posts: 59
Karma: 10
Join Date: Apr 2012
Device: Kindle Fire
Quote:
Originally Posted by BetterRed View Post
@[B]what formats did you convert from and to?
BR
Unfortunately, I've no idea now, it's been in my library for months, maybe years. Maybe in hindsight, I didn't even actually convert it.

Quote:
Originally Posted by BetterRed View Post
@[B]

The editor's Reports Tool can provide a list of the characters used in a book, I use it to determine if a book has any 'unusual' characters and track them down of necessary (it's a clickable list)

BR
I've never used that, but it is very useful. Thanks
ColMac is offline   Reply With Quote
Old 02-22-2022, 11:09 AM   #10
Sarmat89
Fanatic
Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.
 
Posts: 515
Karma: 2268308
Join Date: Nov 2015
Device: none
Just replace "UTF-8" with "Latin-1" in the header of the problem file.
Sarmat89 is offline   Reply With Quote
Old 02-22-2022, 12:36 PM   #11
DNSB
Bibliophagist
DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.
 
DNSB's Avatar
 
Posts: 45,312
Karma: 168808723
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
Quote:
Originally Posted by Sarmat89 View Post
Just replace "UTF-8" with "Latin-1" in the header of the problem file.
Sadly, an epub MUST use UTF-8 or UTF-16. Amazon is not happy with an epub sent to it for publishing via KDP that does not use UTF-8 though I've never tried sending one with UTF-16 to see if that also works.
DNSB is online now   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Em dash and en dash in Comment? kakkalla Library Management 3 06-11-2020 05:58 AM
GoComics - Conversion Error / No articles found, aborting (Error Code: 1) Purple Lady Recipes 18 04-08-2018 04:05 PM
Problem with em dash, en dash and apostrophes jerrywat Conversion 4 10-25-2012 09:43 AM
em dash conversion? stief Conversion 4 06-30-2011 07:18 PM
Error in conversion. Dyllan Conversion 9 04-11-2011 06:56 PM


All times are GMT -4. The time now is 10:31 PM.


MobileRead.com is a privately owned, operated and funded community.