View Single Post
Old 05-22-2010, 10:05 PM   #226
DoctorOhh
US Navy, Retired
DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.
 
DoctorOhh's Avatar
 
Posts: 9,897
Karma: 13806776
Join Date: Feb 2009
Location: North Carolina
Device: Icarus Illumina XL HD, Kindle PaperWhite SE 11th Gen
Quote:
Originally Posted by Sadhu View Post
I've been trying to convert a book displays properly in Word and Acrobat
What format is it in when you view it in Word?

Quote:
Originally Posted by Sadhu View Post
I began by removing all footers an d page numbers from the original document, and then removed all hard page breaks except for the page before a new chapter (there are four chapters, and three hard page breaks). What surprised me was the when proofing the iBook (ePub), not only didn't the chapters always break properly
This is a good start. Chapter breaks in ePubs seem to be often used/inserted to ensure that no single html file within the epub is larger then the 300k limit. Otherwise many ePub renderers will not be able to read the file.

Quote:
Originally Posted by Sadhu View Post
Very disheartened by this, since Word and Acrobat have no code to make visible to find what is causing all this havoc.
In Word you have to reveal "Show paragraph marks and other hidden formatting symbols" this is done by pressing [cntr-shift-*] to toggle the view. Or to reveal the formatting marks go to the Home tab, in the Paragraph group, click Show/Hide button to the right of Sort.

You will see a bunch of paragraph marks causing the hard line breaks. When I have text like this I use My TXT Cleaner an extension for Open Office.org Writer program.

Quoted from the web page. (English version scroll down the page)

Quote:
Do you have problems with
texts having unwanted
line breaks like
this one?

This happens because there are some unwanted paragraph marks along the text. If we take the text from a PDF, inevitably we will get a paragraph mark at each end of line.

Now, or you delete them one by one with a lot of patience, or you can use the macro MyTXTcleaner that will do the work for you.

How does it work?
My TXT Cleaner removes all of the paragraph marks following lowercase letters, commas and semicolons. This because it is assumed that a paragraph usually don’t ends with a letter …

Last edited by DoctorOhh; 05-22-2010 at 10:14 PM.
DoctorOhh is offline