Quote:
Originally Posted by Ser4nb2LUY6e
webnovels.com hat another updated and the author parsing will need to be fixed.
Also, there are still escaping issues. As an example, in Soaring of Galaxia, Chapter 236, looking in the browser you can find the sentence "This sword move was the first move from the <Three Sword Moves to Extinguish the Sky> that he had passed down to Wei Yi." in the epub it shows up as "This sword move was the first move from the that he had passed down to Wei Yi."
|
That's the site's fault, not mine. The content is marked as "rich format", but whoever uploaded the actual chapter content didn't properly escape '<' and '>' within the HTML. To give you an idea:
PHP Code:
<p>This sword move was the first move from the <Three Sword Moves to Extinguish the Sky> that he had passed down to Wei Yi. It incorporated the use of the natural power directly to the sword light and would bring out the offensive power of metal elements. The destructive power was indeed, as powerful as swallowing the sky and earth and invincible!</p>
As you can see there are proper HTML tags in the raw chapter content, however the less-than- and greater-than-signs are not properly escaped; actually it should look like this: "<Three Sword Moves to Extinguish the Sky>". There is nothing I can do to fix this, except rely on heuristics and check potentially wrong tags against a list of all known HTML5 tags, and then replace the given tag with a properly escaped string. I'm not sure when I'll have time to do this, but I bet this is one of the very few stories (possibly even one of the only chapters) on the site, that has this issue.
EDIT: There's no easy way to apply the heuristic talked above, because after parsing the HTML chapter content using our parser the "tag" looks like this:
PHP Code:
<three extinguish="" moves="" sky="" sword="" the="" to=""> that he had passed down to Wei Yi. It incorporated the use of the natural power directly to the sword light and would bring out the offensive power of metal elements. The destructive power was indeed, as powerful as swallowing the sky and earth and invincible!</three>
As you can see, the original order of the words and their casing is lost. I also can't find a way to find the original tokens from which this node was created. So yeah, you probably won't see this problem often: in fact if you search for "<Coagulated Sword>" you can see these and probably various other instances escaped properly.