![]() |
#1 |
Connoisseur
![]() Posts: 98
Karma: 10
Join Date: Oct 2014
Device: kindle pw3
|
html to epub and chap detection
hello
i use the tool ebook-convert for convert a html file in a epub in my html file i have Code:
<mbp:pagebreak /><mbp:section> Code:
[..] </div> </div><mbp:pagebreak /><mbp:section><header></header> <div class="entry-content"> <div id="chapterContent" class="innerContent fr-view"> <p dir="ltr">Chapter [..] i use : ebook-convert "something.html" "something.mobi" PS the format of the chapter isn't always the same but the pagebreak and section is always at the end of the chapter sometimes the chapter is in a <h2> tag and is detected, but i don't have a standard Last edited by Trigun; 02-07-2017 at 06:33 AM. |
![]() |
![]() |
![]() |
#2 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,364
Karma: 27230406
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
<mbp
![]() |
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Connoisseur
![]() Posts: 98
Karma: 10
Join Date: Oct 2014
Device: kindle pw3
|
i know where the chapter start (i merge different urls in a single html and after convert it to mobi)
i'm the one who add that tag for separate the chapters but i don't know the structure of the text inside of the urls sometimes the url is formatted with h2 title sometimes is different and when that happens i don't have the chapter end (i don't care about the title of the chap but atleast i want a division from the previous chapter) how i can do that? i can add a h2 title too instead of the mbp pagebreak but in that case if the text have already a h2 title i have double chapter Last edited by Trigun; 02-07-2017 at 09:19 AM. |
![]() |
![]() |
![]() |
#4 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6,252
Karma: 16544692
Join Date: Sep 2009
Location: UK
Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3
|
If you know the mbp pagebreak tags are all already in the right places, you could try search/replacing all the mbp pagebreak tags with something like <h6></h6> (if your text doesn't contain any other h6 tags). Then use calibre to split on h6 tags. Finally remove the empty h6 pairs. Crude but functional.
|
![]() |
![]() |
![]() |
Thread Tools | Search this Thread |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
azw3 to epub - structure detection bug ? | cybmole | Conversion | 6 | 02-02-2014 07:24 PM |
Underline disappearing for Chap headings | oldghost | Conversion | 4 | 04-22-2012 04:37 PM |
Epub to Mobi Chapter Detection | ice2097 | Calibre | 4 | 12-29-2010 02:14 AM |
epub - force a 2nd pass to improve structure detection ? | cybmole | Calibre | 10 | 10-08-2010 01:00 AM |
To MOBI, Chapter detection fails? Works for EPUB | Fmstrat | Calibre | 7 | 08-29-2010 05:37 PM |