View Single Post
Old 11-17-2017, 05:30 PM   #2537
mehetabelo
e-Bibliophile
mehetabelo began at the beginning.
 
mehetabelo's Avatar
 
Posts: 60
Karma: 10
Join Date: Jun 2009
Location: California
Device: Paperwhite 1-3, Kobo AuraHD, Boox Afterglow2
I know that this is probably not an easy thing to do - but I have a suggestion for adding an option into the interface.

Somewhere, it would be *really* helpful to have a regex option for find/replace that directly applies to the html. This suggestion is based on spending several hours (maybe even days) of time working through some of the newer sites and fixing the html particularly the paragraph setup so that they show properly on my ereaders. It's not the fault of FFF, but rather the shitty HTML on the sites.

I've been running into having multiple <br> lines, or an extra <p> </p> between, or even a paragraph with <p> </p> as the separator (but not actually showing identifying the paragraph) an example of the last is:
<p> </p>
Paragraph X.
<p> </p>
Paragraph Y.
<p> </p>
Which in a kindle leads to a massive cluster because it doesn't read these as it is intended (correct or not). There are other things like &quote appears fairly commonly, instead of " symbols on some older html from webnovel. But it's weird, as it only does it sometimes and I've literally had a chapter that's good, one bad and the next good again.

Different sites do different things and I've worked out some regex replacements that I use in the convert dialog, or (mostly used) in the editor dialog.

Generally these regex are specific enough they should not effect each other, or screw up good text. My problem is anytime I get an update I have to fix the book again with the new chapters. That's why I'm asking for this option. I'm not sure exactly

Since the needed regex would likely vary a little from site to site, I'm Not sure exactly how it should be done, maybe within the preferences (probably best) or maybe separately, but it'd be good to have it available by site.

This is just a suggestion, but I believe it's a good one.

Last edited by mehetabelo; 11-17-2017 at 05:35 PM.
mehetabelo is offline   Reply With Quote