09-22-2012, 05:05 PM | #1 |
Enthusiast
Posts: 30
Karma: 12
Join Date: Jun 2012
Device: Ipad mini + Kindle Touch
|
Problem with image converting to text
Hi
I am having a strange problem when converting an EPUB book to AZW3 format. The book has some screen prints on various pages. These are inserted into the EPUB book no problem using the <img> tag and JPG files. When converting to AZW3 from EPUB the image is converted successfully to become part of the AZW3 file. The problem is that two images also convert some of the text showing in the image to additional parts for the AZW3 file? I have tried various things but can't stop this problem. Does anyone know what is causing this? The attachments show a screenshot from an image successfully converted (temp2) and a screenshot from unwanted text/images appearing in addition also (temp1). Thanks for any help. Goggs75 |
09-23-2012, 12:02 AM | #2 |
creator of calibre
Posts: 43,795
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
The extra text will not have come from the image, it will be present in the original file but likely hidden in some way, a way that does not survice the conversion to azw3. If you want more information, follow the instructions here: https://www.mobileread.com/forums/sho...d.php?t=186697
|
Advert | |
|
09-23-2012, 04:40 AM | #3 |
Enthusiast
Posts: 30
Karma: 12
Join Date: Jun 2012
Device: Ipad mini + Kindle Touch
|
Hi David
I am very impressed with Calibre. Attached is Chapter 2 of the EPUB file that is causing the problem. I converted the file to AZW3. The first and second image exist when converting but they also show lots of additional images/text underneath. I have only attached the EPUB file as an AZW3 file could not be attached. Thanks for any help. |
09-23-2012, 08:27 AM | #4 |
Enthusiast
Posts: 30
Karma: 12
Join Date: Jun 2012
Device: Ipad mini + Kindle Touch
|
Hi
I think I can see what is causing the problem but do not know how to solve this. The book that Chapter 2 is from is a book on HTML and CSS. This means there is a lot of HTML and CSS text included alongside the normal text. On conversion Calibre is treating the text below the first and second image (HTMLSTRUC1v3.jpg and HTMLSTRUC2v3.jpg) as though they are actually a string of commands (property tags and properties etc). On checking the debug folder there is no error showing on any of the text files created. The first time I can see it when looking at the AZW3 file created on the Ebook veiwer. |
09-23-2012, 09:34 AM | #5 |
creator of calibre
Posts: 43,795
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
I haven't looked at your file yet, but you should be escaping embedded html using entities. Use < for < and > for > and so on.
|
Advert | |
|
09-23-2012, 10:31 AM | #6 |
Enthusiast
Posts: 30
Karma: 12
Join Date: Jun 2012
Device: Ipad mini + Kindle Touch
|
Hi
The file already escapes embedded HTML using < for < and > for >. It works fine before converting to AZW3. It continues to escape this after converting except for the text following the first two images ? |
09-23-2012, 12:22 PM | #7 |
creator of calibre
Posts: 43,795
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Congratulations, you found a bug Fix will be in the next release.
|
09-23-2012, 02:21 PM | #8 |
Enthusiast
Posts: 30
Karma: 12
Join Date: Jun 2012
Device: Ipad mini + Kindle Touch
|
Hi David
Thanks for your help and program. |
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Text over image | kamanza | ePub | 5 | 09-17-2012 12:40 PM |
Image overlayed over text (but text visible if image disabled)? | Kaylee Skylyn | ePub | 5 | 08-01-2012 05:27 PM |
Image and Text problem from epub to mobi for Kindle DX | congngo | Conversion | 0 | 12-05-2011 04:48 PM |
Did you encounter a same problem about converting PDF to text/word/image | Ivymin | 3 | 11-29-2011 08:45 PM | |
text and image | pimpoum | 2 | 05-31-2009 04:26 AM |