|
![]() |
|
Thread Tools | Search this Thread |
![]() |
#1 |
.~^пиратка^~.
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 238
Karma: 14000
Join Date: Sep 2009
Location: Ask NSA...
Device: Onyx Boox M92
|
Too big gaps between paragraphs, sentences split with break in between....
Some files off the internet have lost a lot of their formatting and are in a bad shape - for example paragraphs with several lines between them (instead of just one), sentences split in the middle with several empty lines between etc.
Or for exampke a diamond shaped question mark replacing characters like apostrophes. What are some tricks to clean up files in such a bad shape? |
![]() |
![]() |
![]() |
#2 |
GuteBook/Mobi2IMP Creator
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
|
Without any example files it's hard to say...
![]() You can try to cleanse the file by converting the empty lines using search and replace constructs and/or converting to html each paragraph block with <p> which inherently ignores whitespace. The strange characters you see in place of apostrophes is a character encoding problem i.e. UTF-8 vs ANSI vs dos text. I try to always work in html and try to avoid literal characters for extended dos characters and use their equivalent html codes i.e. © for © Your best tool would be a powerful text editor like textplus or notepad+ and some knowledge of regex's (regular expression pattern matching)! |
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
Or start with HTML tidy. It can fix a lot of things.
|
![]() |
![]() |
![]() |
#4 |
.~^пиратка^~.
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 238
Karma: 14000
Join Date: Sep 2009
Location: Ask NSA...
Device: Onyx Boox M92
|
Thanks for the advice so far!
I started a more specific thread about this in the Sigil section. |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
ebook has words running together with no gaps between them likethis | DarkRoast | General Discussions | 19 | 01-06-2011 01:05 AM |
Immense gaps between paragraphs | astra | ePub | 7 | 12-10-2010 10:21 AM |
big book--how to break up | monsieurms | Workshop | 8 | 02-03-2010 11:36 PM |
Filling in gaps in a PDF scan | Sparrow | Workshop | 0 | 08-10-2009 02:50 PM |
Large gaps before Chapters | PieOPah | Calibre | 13 | 01-27-2009 12:02 PM |