Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Sigil

Notices

Reply
 
Thread Tools Search this Thread
Old 08-15-2011, 02:19 AM   #1
kamanza
Connoisseur
kamanza began at the beginning.
 
Posts: 98
Karma: 10
Join Date: Jan 2011
Device: none
Text formatting (newbie questions)

I've been trying to make an epub out of rtf using 3 methods: directly in calibre,
Writer2EPUB & calibre rtf to txt >sigil.
In all cases the result was the same: a lot of broken lines & blank lines between paragraphs instead if indents, which makes the book very ugly & difficult to read.
Removing all the </p> <p> pairs manually even using "Find & Replace <p>[a-z]
Find Next" feature takes forever & being a newbie (my second day with sigil) i don't know where to start with removing the blank lines.
I would also like to be able to decrease the font size in the resulting ebook -
my desktop ADE goes to 3 column display if i do it there & i like an ebook to be as much like the paper one as possible.
I apologize in advance if my questions are too dumb but help would be very much appreciated.
kamanza is offline   Reply With Quote
Old 08-15-2011, 04:20 AM   #2
weedfreak
Addict
weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.
 
weedfreak's Avatar
 
Posts: 302
Karma: 185297
Join Date: Sep 2009
Location: Ankh Morpork
Device: calibre
It sounds as if the file may have hardcoded line feeds in it. Luca (the creator of writer2epub) also has a writer plugin called text cleaner which can be helpful in removing blank lines. Give that a try then run though writer to epub, the file should be a lot better and Sigil can do the tidying up afterwards.

By setting a smaller font in writer you should get a smaller font after running it through writer2epub, or you could modify the css with Sigil to get whatever size you want, experiment with "font-size: xem" where x is the font size, less than 1 is smaller, greater than 1 is larger.
weedfreak is offline   Reply With Quote
Old 08-15-2011, 06:00 AM   #3
kamanza
Connoisseur
kamanza began at the beginning.
 
Posts: 98
Karma: 10
Join Date: Jan 2011
Device: none
Thanks a lot, going to try right now.
kamanza is offline   Reply With Quote
Old 08-15-2011, 06:11 AM   #4
Toxaris
Wizard
Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.
 
Toxaris's Avatar
 
Posts: 3,183
Karma: 7180223
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-300, PRS-T1
What is the quality of your RTF? Usually it will be garbage in, garbage out.

There are also other RTF to HTML converters out there. You can import the resulting HTML directly in Sigil.
Toxaris is offline   Reply With Quote
Old 08-15-2011, 12:20 PM   #5
theducks
Grand Sorcerer
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 15,270
Karma: 6022733
Join Date: Aug 2009
Location: (The original) Silicon Valley, USA
Device: Galaxy Tab 2, Astak Pocket Pro, K4NT
Once you solve the other issues
font size is usually a piece of cake ( except when: see @Toxaris post).

In the Style sheet:
find the class used in the <body> tag. change the font-size: value
(test results by looking at many different types (in book usage) pages)


If only a few areas have issues, spot fix their classes font-size. If not, you are going to have to work backwards through the nested 'box model' to find the culprit.

IMHO 300+ line CSS for a simple book = a GIGO issue
theducks is offline   Reply With Quote
Old 08-16-2011, 07:03 AM   #6
kamanza
Connoisseur
kamanza began at the beginning.
 
Posts: 98
Karma: 10
Join Date: Jan 2011
Device: none
Quote:
Originally Posted by weedfreak View Post
It sounds as if the file may have hardcoded line feeds in it. Luca (the creator of writer2epub) also has a writer plugin called text cleaner which can be helpful in removing blank lines. Give that a try then run though writer to epub, the file should be a lot better and Sigil can do the tidying up afterwards.

The text cleaner did an admirable job connecting most of the broken lines but at the same time it substituted all the apostrophes, quotes and dashes with question marks, which in turn disappeared completely when imported to sigil.
Is there a way around this problem? If at least the ?'s stayed in sigil i could try to fix it back with Find & Replace.

The other question of the font size is really as simple as you said.

Thanks and if you have any further suggestions i would very much welcome them.
kamanza is offline   Reply With Quote
Old 08-16-2011, 10:16 AM   #7
Toxaris
Wizard
Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.
 
Toxaris's Avatar
 
Posts: 3,183
Karma: 7180223
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-300, PRS-T1
I still suspect your source file is not good. Convert your file first to HTML and try to repair it there.
Toxaris is offline   Reply With Quote
Old 08-16-2011, 12:32 PM   #8
theducks
Grand Sorcerer
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 15,270
Karma: 6022733
Join Date: Aug 2009
Location: (The original) Silicon Valley, USA
Device: Galaxy Tab 2, Astak Pocket Pro, K4NT
Quote:
Originally Posted by kamanza View Post
The text cleaner did an admirable job connecting most of the broken lines but at the same time it substituted all the apostrophes, quotes and dashes with question marks, which in turn disappeared completely when imported to sigil.
Is there a way around this problem? If at least the ?'s stayed in sigil i could try to fix it back with Find & Replace.

The other question of the font size is really as simple as you said.

Thanks and if you have any further suggestions i would very much welcome them.
That sounds like a wrong/missing 'Character Encoding' declaration in the document issue. Does Text cleaner have a force this encoding setting?
theducks is offline   Reply With Quote
Old 08-17-2011, 11:03 AM   #9
weedfreak
Addict
weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.weedfreak can program the VCR without an owner's manual.
 
weedfreak's Avatar
 
Posts: 302
Karma: 185297
Join Date: Sep 2009
Location: Ankh Morpork
Device: calibre
I have come across this problem of character encoding a couple of times in the last year, it seems, as far as I can tell, not to be a function of text cleaner but something in the original file. It seems that some Microsoft software is not applying the correct coding and this error propagates into other systems getting confused and trying to re-encode. The main problem seems to be Libre(or Open)Office trying to correct errors in Word. Without being biased it seems that many US Windows installs are not Internationally aware.

Have you tried the other suggestion of using Calibre to convert to html first? This should solve the encoding issue and if the linefeeds are still there text cleaner should just remove them.
weedfreak is offline   Reply With Quote
Old 08-17-2011, 07:50 PM   #10
ldolse
Wizard
ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.
 
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
Calibre also has a line unwrap feature under heuristics, and it's non-ascii handling for rtf was improved a while back as well.

If neither Calibre or WritertoePub can handle the non-ascii characters correctly then use Open Office or Word to convert to HTML. Then use the html source in Calibre.
ldolse is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Kindle book formatting questions - text-to-speech, line spacing vermontcathy Writers' Corner 2 03-11-2011 04:04 AM
Text formatting jerrywojo Ectaco jetBook 4 01-19-2010 04:37 PM
help with formatting text files chooky Workshop 2 11-26-2009 05:16 AM
Text tool for formatting Gutenberg text files bob_ninja Workshop 5 11-13-2007 01:28 PM
PRS-500 Text Formatting Tool tesseract420 Sony Reader Dev Corner 5 09-13-2007 06:36 PM


All times are GMT -4. The time now is 08:08 PM.


MobileRead.com is a privately owned, operated and funded community.