| 
	|||||||
![]()  | 
            
        
    
| 
             | 
        Thread Tools | Search this Thread | 
| 
			
			 | 
		#1 | 
| 
			
			
			
			 .~^пиратка^~. 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 238 
				Karma: 14000 
				Join Date: Sep 2009 
				Location: Ask NSA... 
				
				
				Device: Onyx Boox M92 
				
				
				 | 
	
	
	
		
		
			
			 
				
				Too big gaps between paragraphs, sentences split with break in between....
			 
			
			
			Some files off the internet have lost a lot of their formatting and are in a bad shape - for example paragraphs with several lines between them (instead of just one), sentences split in the middle with several empty lines between etc.  
		
	
		
		
		
		
		
		
		
		
		
		
	
	Or for exampke a diamond shaped question mark replacing characters like apostrophes. What are some tricks to clean up files in such a bad shape?  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#2 | 
| 
			
			
			
			 GuteBook/Mobi2IMP Creator 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,958 
				Karma: 2530691 
				Join Date: Dec 2007 
				Location: Toronto, Canada 
				
				
				Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Without any example files it's hard to say...  
		
	
		
		
		
		
		
		
		
		
		
		
	
	![]() You can try to cleanse the file by converting the empty lines using search and replace constructs and/or converting to html each paragraph block with <p> which inherently ignores whitespace. The strange characters you see in place of apostrophes is a character encoding problem i.e. UTF-8 vs ANSI vs dos text. I try to always work in html and try to avoid literal characters for extended dos characters and use their equivalent html codes i.e. © for © Your best tool would be a powerful text editor like textplus or notepad+ and some knowledge of regex's (regular expression pattern matching)!  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| Advert | |
| 
         | 
    
| 
			
			 | 
		#3 | 
| 
			
			
			
			 Grand Sorcerer 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,470 
				Karma: 13095790 
				Join Date: Aug 2007 
				Location: Grass Valley, CA 
				
				
				Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Or start with HTML tidy. It can fix a lot of things.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#4 | 
| 
			
			
			
			 .~^пиратка^~. 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 238 
				Karma: 14000 
				Join Date: Sep 2009 
				Location: Ask NSA... 
				
				
				Device: Onyx Boox M92 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Thanks for the advice so far!  
		
	
		
		
		
		
		
		
		
		
		
		
	
	I started a more specific thread about this in the Sigil section.  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
![]()  | 
            
        
    
| Thread Tools | Search this Thread | 
            
  | 
    
			 
			Similar Threads
		 | 
	||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| ebook has words running together with no gaps between them likethis | DarkRoast | General Discussions | 19 | 01-06-2011 02:05 AM | 
| Immense gaps between paragraphs | astra | ePub | 7 | 12-10-2010 11:21 AM | 
| big book--how to break up | monsieurms | Workshop | 8 | 02-04-2010 12:36 AM | 
| Filling in gaps in a PDF scan | Sparrow | Workshop | 0 | 08-10-2009 03:50 PM | 
| Large gaps before Chapters | PieOPah | Calibre | 13 | 01-27-2009 01:02 PM |