Quote:
Originally Posted by Feynor
...
I know where it fails, I just need to figure out how to best fix it, as it is really something AO3 is doing. However others might be doing the same thing.
Heuristics are hard!
|
Are you ready for a test version of your heuristics code to be posted for others to try?
Quote:
Originally Posted by Feynor
P.S.: Jimm, I pushed a modified hpfanficarchive adapter, as I found that it left HTML comments in the story body, which would then become a garbled mess later, I don't know if that was something my br_to_p function did, but the result was that for instance Google Books wouldn't parse the output. So I simply remove all HTML comments. They serve no purpose in a story anyway.
|
Please post a story url or two that showed the problem. I've seen some comments issues with other sites and came up with a different solution (using the BS4 parser). And I fear your comment replacement code is too aggressive. Won't it remove all of "<!-- comment --> story <!-- comment -->", not just the comments?