![]() |
#1 |
Junior Member
![]() Posts: 2
Karma: 10
Join Date: Jan 2014
Device: Kobo Aura
|
![]()
Hi there
I have been converting azw3 ebooks to epub using Calibre, and all was good until the last book. It looks like new lines are being converted to page breaks, so for example, in the chapters section, each line is on a new page. This has ended up with the book being nearly 5000 pages long, when it should be around 480. Any hints on how to fix this would be appreciated. ps I tried turning on Heuristics using the default settings, but this didnt work either. Last edited by dt_nz; 01-19-2014 at 02:59 AM. |
![]() |
![]() |
![]() |
#2 |
eBook Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 85,544
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
|
![]() |
![]() |
Advert | |
|
![]() |
#3 |
US Navy, Retired
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 9,896
Karma: 13806776
Join Date: Feb 2009
Location: North Carolina
Device: Icarus Illumina XL HD, Kindle PaperWhite SE 11th Gen
|
|
![]() |
![]() |
![]() |
#4 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,720
Karma: 1759970
Join Date: Sep 2010
Device: none
|
Quote:
OP could try what I did - go to preferences, structure detection, remove the default commands - reconvert- see if that works as a work-around. PS as I am now a little geeky-curious, is there a simple tool that allows a peek at the source AZW3 code, to see if there is anything non-standard with how paragraphs are constructed |
|
![]() |
![]() |
![]() |
#5 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6,251
Karma: 16539642
Join Date: Sep 2009
Location: UK
Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3
|
|
![]() |
![]() |
Advert | |
|
![]() |
#6 |
eBook Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 85,544
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
|
![]() |
![]() |
![]() |
#7 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,720
Karma: 1759970
Join Date: Sep 2010
Device: none
|
does it not actually convert it to an epub on the fly - like the calibre viewer ?
Ah - seems not & I see the issue: I put a fresh copy of the book into calibre, so there was only the one format, & pressed T. sure enough it opened. & I see that "chapter" is being used within actual chapters, as a class for each sentence, like this: <p class="chapter">She climbed down.... That must be confusing the default xpath structure detection formulae - so its not a "bug" as such - it is a case where the default structure detection needs to be tweaked Last edited by cybmole; 01-19-2014 at 10:13 AM. |
![]() |
![]() |
![]() |
#8 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,271
Karma: 27111060
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Neither the calibre viewer nor edit book convert to epub on the fly. Both of them extract the html/css from the binary azw3 wrapper, in exactly the same way.
|
![]() |
![]() |
![]() |
#9 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,720
Karma: 1759970
Join Date: Sep 2010
Device: none
|
Quote:
am I right in my previous post - that use of "chapter" as a CSS class would confuse structure detection ? |
|
![]() |
![]() |
![]() |
#10 |
eBook Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 85,544
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
|
![]() |
![]() |
![]() |
#11 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,271
Karma: 27111060
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
It doesn't get confused, it makes the, very reasonable, assumption that if a tag is marked as class="chapter" it corresponds to a chapter. There are some ebooks out there that feel putting class="chapter" on all their tags is a smart thing to do, it takes all kinds...
|
![]() |
![]() |
![]() |
#12 |
eBook Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 85,544
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
If you do find a book like that, it's extremely easy to fix. On the conversion options screen, go to "Structure Detection" and delete the part of the string that says something like "or class='chapter'".
|
![]() |
![]() |
![]() |
#13 |
Junior Member
![]() Posts: 2
Karma: 10
Join Date: Jan 2014
Device: Kobo Aura
|
Just saw HarryTs reply, gave it a crack, and it works. Awesome!!
Last edited by dt_nz; 01-20-2014 at 04:17 AM. |
![]() |
![]() |
![]() |
#14 |
eBook Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 85,544
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
If the chapters are called "Chapter N", then you can entirely remove the class name detection from the xpath; it'll match on the name.
|
![]() |
![]() |
![]() |
#15 |
eBook Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 85,544
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
Great!
![]() The only reason I know this is that I had the same problem myself - not with books being split on each line, but some books that had chapter headings like: Chapter 5 Our Hero Takes Action Having the chapter name and title appear on separate pages, for the same reason (a class name called "chapter_something" being used for the chapter title). |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Converted files have a huge space between lines. Unreadable | Skyliner390 | Conversion | 7 | 06-22-2011 05:41 AM |
Adding page breaks in Calibre breaks ePubcheck validation | bookraft | Conversion | 16 | 03-01-2011 01:23 PM |
Calibre cut lines in a converted CHM to MOBI | jomaweb | Calibre | 12 | 07-21-2010 03:07 PM |
PRS-505 Missing several lines in converted Epubs | VanPersie | Sony Reader | 3 | 04-25-2010 02:26 PM |
Adding chapter breaks by number of pages/lines | raptir | Calibre | 8 | 10-22-2009 12:11 PM |