![]() |
#1 | |||
Connoisseur
![]() Posts: 81
Karma: 10
Join Date: Aug 2010
Location: Murcia/Spain
Device: Android 12
|
PDF to HTML page break questions
I'm trying to convert PDF to ePub, this is how I did it:
1- saving PDF as HTML 4.0 with CSS 1.0 format after saving it, I opened in FF and noticed that some paragraphs are broken (there are some blank spaces between char as below): Quote:
This is what I have in my HTML file: Quote:
Quote:
Win XP, Acrobat Pro 8. Thanks for your comment/suggestion Michael |
|||
![]() |
![]() |
![]() |
#2 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,213
Karma: 12890
Join Date: Feb 2009
Location: Amherst, Massachusetts, USA
Device: Sony PRS-505
|
Have you tried using pdfreflow instead?
The only other thing I could suggest would be a reg ex find and replace. The exact syntax would vary by editor, and you'd have to figure out the rule -- would it be, if a paragraph ends with a lowercase letter, and the next one begins with one, merge the paragraphs? |
![]() |
![]() |
Advert | |
|
![]() |
#3 | |
Connoisseur
![]() Posts: 81
Karma: 10
Join Date: Aug 2010
Location: Murcia/Spain
Device: Android 12
|
Quote:
|
|
![]() |
![]() |
![]() |
#4 | |
Connoisseur
![]() Posts: 81
Karma: 10
Join Date: Aug 2010
Location: Murcia/Spain
Device: Android 12
|
Quote:
1) There's a </body> at the top of generated html file, I think this should be an opening instead of closing <body> (further down there's another </body>). 2) Sometimes there's missing </p>. For instances, the chapter sometimes has a closing </p> tag, sometime it's missing. |
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
How do I create chapters without a page-break between? | bfollowell | Sigil | 22 | 01-02-2011 12:38 PM |
How Do I Create A HTML jetBook Page Break? | galavanter | Ectaco jetBook | 21 | 10-29-2009 12:05 PM |
Why two separate page break xpaths in 0.6.x? | ldolse | Calibre | 3 | 08-12-2009 01:00 PM |
Page break before h2 question | Amalthia | Calibre | 9 | 04-17-2009 06:33 PM |
Page break before <b> | flowoeB | Calibre | 14 | 04-12-2009 03:05 PM |