I'm trying to convert PDF to ePub, this is how I did it:
1- saving PDF as HTML 4.0 with CSS 1.0 format
after saving it, I opened in FF and noticed that some paragraphs are broken (there are some blank spaces between char as below):
Quote:
...that it was not alive, but her
fear remained...
|
then I opened the HTML file with NotePad++ and noticed that there's a <p> tag around the "fear remained...", further analysis I noticed that the "fear remained..." is in a new page, so I checked other new pages, it seems some time Acrobat puts in <p> on the page break, and some times not. Anyone knows how to fix it without having to edit it one by one?
This is what I have in my HTML file:
Quote:
<P><SPAN>.....</SPAN
></P>
<P style="text-align:center; margin-left:0px">
<SPAN style="font-size:12pt; font-weight:normal; color:#000000"
>fear remained. For it was the first artificial object that she had ever seen. </SPAN
></P>
|
Some of the paragraphs does not break, so I have
Quote:
<P><SPAN>.....</SPAN
>
<SPAN style="font-size:12pt; font-weight:normal; color:#000000"
>fear remained. For it was the first artificial object that she had ever seen. </SPAN
></P>
|
ps: the environment I'm using is:
Win XP, Acrobat Pro 8.
Thanks for your comment/suggestion
Michael