Note that line unwrapping is really particular to each PDF file, not your system itself. PDFs are finicky and you will find that you need to tweak settings for every one you convert. Some like a larger unwrap factor. Some like a smaller unwrap factor. Most need individually customized regexes to remove header/footer bits. Many won't find chapters correctly and thus require a second pass through Sigil or another epub editor.
Also note that epubs don't technically have "pages" because there's no concept of a fixed page size (though there is a concept in the standard for virtual pages, but a single virtual page might span 3 or 4 actual "pages" on your reading device). Epubs are flowable, and they fit to whatever screen you're reading them on. A PDF that converts to 300 "pages" on an iPod-sized screen may only be 100 "pages" on an iPad. Don't judge your output quality by the number of "pages" result from the conversion. That's a meaningless metric. Judge the output quality based on the output quality. Did the lines unwrap correctly? Did the header/footer get removed correctly? Were headings and other formatting correctly preserved? Is the content readable?
|