View Single Post
Old 01-09-2013, 11:45 AM   #8
Paxman53
Connoisseur
Paxman53 began at the beginning.
 
Posts: 55
Karma: 10
Join Date: Jan 2011
Device: 7" Tablet - Aldiko Reader Premium
Quote:
Originally Posted by DoctorOhh View Post
FYI, sticky posts (like this one) at the top of forums have valuable information.

Hopefully someone will have some insight soon.
Thank you Doctor Ohh,

On that basis I have retried the data and created in Sigil a test Epub file which shows the problem.

Test Subject (Original).epub Test Subject (Converted).epub

Here is the Conversion Log details for that operation:

Initial parse failed, using more forgiving parsers
Parsing OEBPS/Text/Section0002.xhtml as HTML
Reading TOC from NCX...
Merging user specified metadata...
Detecting structure...
Inserting metadata into book...
Flattening CSS and remapping font sizes...
Source base font size is 12.00000pt
Removing fake margins...
Found 2 items of level: div_1
Found 1 items of level: div_2
div_1 left margin stats: Counter()
div_1 right margin stats: Counter()
div_2 left margin stats: Counter()
div_2 right margin stats: Counter()
Cleaning up manifest...
Trimming unused files from manifest...
Creating EPUB Output...
Splitting markup on page breaks and flow limits, if any...
Looking for large trees in jacket.xhtml...
No large trees found
Looking for large trees in OEBPS/Text/Section0001.xhtml...
No large trees found
Looking for large trees in OEBPS/Text/Section0002.xhtml...
No large trees found
Generating default cover
EPUB output written to C:\Users\GRAEME~1\AppData\Local\Temp\calibre_0.9.1 2_tmp_bdiovg\grgztc.epub

Output File above

Description of problem: Instead of getting text in book view:
[Chapter One

This is test material to show how the conversion process in Calibre is working.]

there seems to be a mixture of HTML symbols and individual letters:

[< ! D O C T Y P E h t m l P U B L I C " - / / W 3 C / / D T D X H T M L 1 . 1 / / E N " " h t t p : / / w w w . w 3 . o r g / T R / x h t m l 1 1 / D T D / x h t m l 1 1 . d t d " > < h t m l x m l n s = " h t t p : / / w w w . w 3 . o r g / 1 9 9 9 / x h t m l " > < h e a d > < t i t l e > < / t i t l e > < / h e a d > < b o d y > < h 3 > C h a p t e r O n e < / h 3 > < p > T h i s i s t e s t m a t e r i a l t o s h o w h o w t h e c o n v e r s i o n p r o c e s s i n C a l i b r e i s w o r k i n g . < / p > < / b o d y > < / h t m l >

No error message shown.

No conversion options changed from my normal usage, which is:

Look & Feel: disable font size - unchecked - base font size 12.0 pt.
remove spacing - checked
indent size 1.5 em
left allign
smarten puctuation - checked
keep ligatures - checked
Extra CSS - h2, h3, h4, h5 { text-align: center }

Heuristic Processing: not enabled

Page Setup: Input Profile - Default, Output Profile - Tablet

Structure Detection: Detect Chapters - //*[((name()='h2' or name()='h3') and re:test(., 'chapter|book|section|part|0|1|2|3|4|5|6|7|8|9\s+' , 'i')) or @class = 'chapter']

Remove fake margins - checked
Insert Metadata - checked

Table Of Contents: Level 1 Toc //h:h2, Level 2 Toc //h:h3, Level 3 Toc //h:h4

Search & Replace: CHAPTER to Chapter

Epub Output: Initially Preserve Cover ratio - Checked, then on subsequent conversion - Do not Split On Page Breaks - Checked

Since my initial reporting of the problem I have also tried deleting the Metadata file in Calibre Library and restoring the Library from Backup - has not resolved the problem.

I have also retried existing files and they are now recreating the problem.

I do hope someone can point me in the right direction as I would be lost without Calibre.

Looking forward to hearing from someone.
Paxman53 is offline   Reply With Quote