Quote:
Originally Posted by Tanjamuse
|
These two have the same problem--the
description field is empty
and the adapter leaving <td></td> tags around it. Calibre's sanitize_html doesn't like that combination. Change <td> to <div> and it's happy again.
Quote:
Originally Posted by Tanjamuse
http://www.storiesofarda.com/chapterlistview.asp?SID=4342
|
This one is a different issue. The author entered bad HTML (an unmatched <a> tag) in the story description. This breaks the metadata parsing--and page format:
I can add code to catch the exception and go on, but
rating,
status and/or
datePublished will not be collected then. There will be a 'warning' level message in the background job log, but you will not be notified anymore prominently.
Quote:
Originally Posted by Tanjamuse
Also when I try to use the webpage option for http://lcfanfic.com/index.html I get the following error:
Code:
ERROR: Unhandled exception: <b>InvalidStoryURL</b>:Bad Story URL: (http://lcfanfic.com/stories/(Year|_earliest)/html/[storyId]) for site: (lcfanfic.com) Example: (http://lcfanfic.com/stories/(Year|_earliest)/html/[storyId])
|
This is an interesting class of problem that was effecting 10 different adapters. Adapters' getSiteExampleURLs() value has to be able to pass their own getSiteURLPattern() for geturls.get_urls_from_page() to work correctly. (Because reasons.) I didn't remember that when GComyn started putting [storyId] in examples in adapters.