View Single Post
Old 05-25-2017, 04:44 PM   #2123
JimmXinu
Plugin Developer
JimmXinu ought to be getting tired of karma fortunes by now.JimmXinu ought to be getting tired of karma fortunes by now.JimmXinu ought to be getting tired of karma fortunes by now.JimmXinu ought to be getting tired of karma fortunes by now.JimmXinu ought to be getting tired of karma fortunes by now.JimmXinu ought to be getting tired of karma fortunes by now.JimmXinu ought to be getting tired of karma fortunes by now.JimmXinu ought to be getting tired of karma fortunes by now.JimmXinu ought to be getting tired of karma fortunes by now.JimmXinu ought to be getting tired of karma fortunes by now.JimmXinu ought to be getting tired of karma fortunes by now.
 
JimmXinu's Avatar
 
Posts: 7,008
Karma: 4604635
Join Date: Dec 2011
Location: Midwest USA
Device: Kobo Clara Colour running KOReader
Quote:
Originally Posted by Tanjamuse View Post
I have come across 3 stories I can't download from Stories of Arda, here's a list of the stories and errors:

http://www.storiesofarda.com/chapter...w.asp?SID=4538
http://www.storiesofarda.com/chapter...ew.asp?SID=275
These two have the same problem--the description field is empty and the adapter leaving <td></td> tags around it. Calibre's sanitize_html doesn't like that combination. Change <td> to <div> and it's happy again.

Quote:
Originally Posted by Tanjamuse View Post
http://www.storiesofarda.com/chapterlistview.asp?SID=4342
This one is a different issue. The author entered bad HTML (an unmatched <a> tag) in the story description. This breaks the metadata parsing--and page format:
Click image for larger version

Name:	Capture.PNG
Views:	160
Size:	217.9 KB
ID:	156999

I can add code to catch the exception and go on, but rating, status and/or datePublished will not be collected then. There will be a 'warning' level message in the background job log, but you will not be notified anymore prominently.

Quote:
Originally Posted by Tanjamuse View Post
Also when I try to use the webpage option for http://lcfanfic.com/index.html I get the following error:

Code:
ERROR: Unhandled exception: <b>InvalidStoryURL</b>:Bad Story URL: (http://lcfanfic.com/stories/(Year|_earliest)/html/[storyId]) for site: (lcfanfic.com) Example: (http://lcfanfic.com/stories/(Year|_earliest)/html/[storyId])
This is an interesting class of problem that was effecting 10 different adapters. Adapters' getSiteExampleURLs() value has to be able to pass their own getSiteURLPattern() for geturls.get_urls_from_page() to work correctly. (Because reasons.) I didn't remember that when GComyn started putting [storyId] in examples in adapters.
JimmXinu is online now   Reply With Quote