Quote:
Originally Posted by kovidgoyal
You should be able to replace all those regexes with a single regex that simply strips the head section. Why is there the regext that strips code before the opening <head>?
|
Because if not I get a garbage html, with incorrectly processed html5 header content. That is the real reason I did all that stuff. For some reason calibre fails to properly cleanup html5 as generated by la repubblica economic section.