View Single Post
Old 08-19-2010, 03:21 PM   #1
Wintersdark
Junior Member
Wintersdark began at the beginning.
 
Posts: 7
Karma: 10
Join Date: Nov 2009
Device: iPhone 3G
Removing unnecessary line breaks in books.

I have a great many books that were converted from .lit's to .epubs to be Stanza friendly - I read on my iPhone.

My problem is that a very large number of my books (hundreds) have line breaks scattered throughout paragraphs. Now, I convert these epubs to another format and manually edit them to remove the line breaks, but this is very impractical given the number of books.

Is there a way to have Calibre apply a regex search/replace to remove these line breaks on conversion so I could bulk-convert everything at once?

I figure that searching for "^13([a-z])" and replacing with " \1" will work often enough to make the text readable at least. There are instances where it will miss, but it's really good enough - though I'm open to ways to do it better of course.

Could I use the header/footer removal settings in calibre's Convert Books regex's to acheive this? Or is that removal only, not replacing?

Thanks!!

Derrick
Wintersdark is offline   Reply With Quote