|  09-06-2025, 06:56 PM | #1 | 
| Junior Member  Posts: 8 Karma: 10 Join Date: Sep 2025 Device: Kindle 10th Generation Paperwhite | 
				
				Calibre tidy line breaks
			 
			
			I have an annoying problem that even after converting books in Calibre editor, some are riddled with random line breaks. Can someone advise me on how to fix without having to manually go through the HTML editor. Is there a code/regex/plug in. TIA. Apologies if this should be on another thread.
		 | 
|   |   | 
|  09-06-2025, 08:41 PM | #2 | |
| Resident Curmudgeon            Posts: 80,677 Karma: 150249619 Join Date: Nov 2006 Location: Roslindale, Massachusetts Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 | Quote: 
 You could use regex to combine lines that do not end in a punctuation mark. That would help a lot. Read this thread, It gives a lot of good information including link(s) to regex to use. https://www.mobileread.com/forums/sh...d.php?t=357635 | |
|   |   | 
|  09-06-2025, 08:53 PM | #3 | 
| Grand Sorcerer            Posts: 13,969 Karma: 243829945 Join Date: Jan 2014 Location: Estonia Device: Kobo Sage & Libra 2 | 
			
			A crappy conversion from PDF will always be a crappy conversion, unless you fix everything by hand, line by line. PDF is not a suitable format for conversion. TL;DR Don't convert from PDF if you can do it in any other way. | 
|   |   | 
|  09-07-2025, 06:50 PM | #4 | |
| Junior Member  Posts: 8 Karma: 10 Join Date: Sep 2025 Device: Kindle 10th Generation Paperwhite | 
				
				Thanks
			 Quote: 
 | |
|   |   | 
|  09-07-2025, 06:52 PM | #5 | 
| Junior Member  Posts: 8 Karma: 10 Join Date: Sep 2025 Device: Kindle 10th Generation Paperwhite | 
			
			Thanks, as I said above, just checked and it was an EPUB, definitely not a PDF conversion.
		 | 
|   |   | 
|  09-07-2025, 06:55 PM | #6 | 
| Resident Curmudgeon            Posts: 80,677 Karma: 150249619 Join Date: Nov 2006 Location: Roslindale, Massachusetts Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 | |
|   |   | 
|  09-07-2025, 08:00 PM | #7 | 
| Wizard            Posts: 1,683 Karma: 9500498 Join Date: Sep 2021 Location: Australia Device: Kobo Libra 2 | 
			
			@Globe Trotsky Careful how you answer the question "Where did the epub come from". It has no bearing to your original question on how to fix the problem. There be wolves among the sheep. | 
|   |   | 
|  09-07-2025, 08:15 PM | #8 | 
| Grand Sorcerer            Posts: 13,969 Karma: 243829945 Join Date: Jan 2014 Location: Estonia Device: Kobo Sage & Libra 2 | 
			
			If it's a bad PDF conversion (and it almost certainly is, by the looks of it), there's no fixing it.
		 | 
|   |   | 
|  09-07-2025, 09:08 PM | #9 | |
| Wizard            Posts: 1,683 Karma: 9500498 Join Date: Sep 2021 Location: Australia Device: Kobo Libra 2 | Quote: 
 If you are prepared to read and fix at the same time, it can be done. Even easier if the original book is available as a reference. A few regex to catch the split sentences are easy enough. It's all the other annoying missing italic, replaced characters, broken words and spurious code that are more difficult/impossible to bulk fix. (not idea why an edit to my post caused a second post to appear) | |
|   |   | 
|  09-07-2025, 09:14 PM | #10 | |
| Bibliophagist            Posts: 47,992 Karma: 174315100 Join Date: Jul 2010 Location: Vancouver Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos | Quote: 
 | |
|   |   | 
|  09-07-2025, 09:19 PM | #11 | 
| null operator (he/him)            Posts: 22,006 Karma: 30277294 Join Date: Mar 2012 Location: Sydney Australia Device: none | 
			
			It's almost certainly a low quality OCR image scan of ink on paper (probably with an ancient version of ABBYY - probably pirated), that was saved as a PDF.  Professionally created PDFs using Acrobat, DTP (InDesign, even Quark), or WP (MS Word, LO Writer, WordPerfect) don't break paragraphs like that. One way to deal with it, is to convert to txt, open with a decent text editor (e.g. Vim, Text Pad, Notepad++) and correct using regex. Then open the corrected text file in one of the WP apps mentioned above, style front-matter, headings, bibliography etc as appropriate and save as DOCX and get calibre to convert that to EPUB. If you use Wordperfect you can save as EPUB directly, which does a better job of mapping its styling to the EPUB CCS. There are also useful addins for Word - EPUB Tools (it is in the MR Workshop forum), and Transtools, which has an excellent Unbreaker tool. BR | 
|   |   | 
|  09-09-2025, 10:05 AM | #12 | 
| Resident Curmudgeon            Posts: 80,677 Karma: 150249619 Join Date: Nov 2006 Location: Roslindale, Massachusetts Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 | 
			
			When I did a search for this book, this is what I found. The Color of Christ The Son of God and the Saga of Race in America Edward J. Blum, Paul Harvey https://www.perlego.com/book/538115/...in-america-pdf It says it's an ePub eBook. But it it's really from a PDF, then that would explain why it's so messed up. Last edited by JSWolf; 09-09-2025 at 10:09 AM. | 
|   |   | 
|  09-09-2025, 02:58 PM | #13 | 
| Junior Member  Posts: 8 Karma: 10 Join Date: Sep 2025 Device: Kindle 10th Generation Paperwhite | 
				
				EPUB
			 
			
			It's definitely EPUB which was downloaded from Z-Library. Screenshot attached.
		 | 
|   |   | 
|  09-09-2025, 03:04 PM | #14 | |
| Junior Member  Posts: 8 Karma: 10 Join Date: Sep 2025 Device: Kindle 10th Generation Paperwhite | Quote: 
 | |
|   |   | 
|  09-09-2025, 03:10 PM | #15 | 
| Bibliophagist            Posts: 47,992 Karma: 174315100 Join Date: Jul 2010 Location: Vancouver Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos | 
			
			For what it may be worth, I found the book on Amazon and Kobo and downloaded a sample from each. The samples did not have the page numbers nor the extraneous line breaks. Kobo CA: https://www.kobo.com/ca/en/ebook/the-color-of-christ-1 Amazon CA: https://www.amazon.ca/Color-Christ-S.../dp/B009DH7YR8 That the screenshots posted by the OP were from a PDF to ePub conversion seems an inescapable conclusion. Since the source mentioned by the OP in a later message was Z-Library, a rather well known pirate site, I'm out of this discussion. Am I a nasty, suspicious person? Very likely. Decades spent in IT do that to a person. Last edited by DNSB; 09-09-2025 at 03:31 PM. Reason: Added links to the book on Amazon and Kobo, added jolly roger to images | 
|   |   | 
|  | 
| 
 | 
|  Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| How to create line breaks? | begtognen | Editor | 16 | 05-08-2025 09:49 PM | 
| Line breaks on Kindle, no line breaks on 4 PC | Siavahda | Kindle Formats | 0 | 10-20-2012 05:50 AM | 
| Adding page breaks in Calibre breaks ePubcheck validation | bookraft | Conversion | 16 | 03-01-2011 01:23 PM | 
| No line breaks | ecpepper | Amazon Kindle | 3 | 08-09-2009 06:42 PM | 
| Calibre PDF to LRF losing line breaks | kad032000 | Calibre | 11 | 06-23-2008 10:22 AM |