|
|
#1 |
|
Junior Member
![]() Posts: 4
Karma: 10
Join Date: Apr 2012
Device: Kindle
|
Article not added when specifying content string
Hello,
I am writing a new recipe and the newspaper site has the full content of ten articles on one HTML page. I iterate through the page and append each article with an empty url but with full content, but these articles are silently skipped, leaving an empty section in the e-book. Here is the code: Code:
for post in ts.findAll('h1'):
title = self.tag_to_string(post)
self.log(title)
url = ''
date = ''
content = self.tag_to_string(post.findNextSibling('p'))
desc = content
articles.append({'title':title, 'url':url, 'date':date, 'description':desc,
'content':content})
|
|
|
|
|
|
#2 |
|
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,608
Karma: 28549044
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
content no longer works (it refers to an obsoleteted API). Instead save your html into temporary files and pass a file:///path/to/temp/file.html as the url.
|
|
|
|
|
|
#3 |
|
Junior Member
![]() Posts: 4
Karma: 10
Join Date: Apr 2012
Device: Kindle
|
Thank you! Will do exactly that.
|
|
|
|
|
|
#4 |
|
Junior Member
![]() Posts: 4
Karma: 10
Join Date: Apr 2012
Device: Kindle
|
That worked perfectly. Now the final barrier to sharing my new recipe is that one image, whose name has a space in it, is not being retrieved, and is showing a broken image in the resulting e-book. Here is some debug output:
Code:
Processing images... Fetching http://www.southernstar.ie/scripts/imgsize.php?w=300&img=../images/news/1312c41.jpg Processing images... Fetching http://www.southernstar.ie/scripts/imgsize.php?w=300&img=../images/news/Rachel MCCarthya.jpg Traceback (most recent call last): File "site-packages/calibre/web/fetch/simple.py", line 369, in process_images File "site-packages/PIL/Image.py", line 1980, in open IOError: cannot identify image file Is there anything I can do about this? |
|
|
|
|
|
#5 |
|
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,608
Karma: 28549044
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
url escape the space
|
|
|
|
|
|
#6 |
|
Junior Member
![]() Posts: 4
Karma: 10
Join Date: Apr 2012
Device: Kindle
|
Recipe for The Southern Star
Thanks; done; works. Attached please find a completed and working recipe for The Southern Star, a regional weekly newspaper since 1889 from County Cork, Ireland. Tested on Mac and Windows Vista.
|
|
|
|
![]() |
| Thread Tools | Search this Thread |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Mathch a string while ignoring some character in that string? | ElMiko | Sigil | 12 | 12-01-2011 11:05 PM |
| I hate added content to pbook versions | jhempel24 | General Discussions | 1 | 09-12-2011 02:57 AM |
| Tip: Article Date needs to be Unicode String | spedinfargo | Recipes | 0 | 02-19-2011 08:08 PM |
| PDF -> MOBI: a string is added to the bottom of each page | falconfoxxx | Calibre | 3 | 09-14-2010 02:28 AM |
| Search for files by content string? | nekokami | iRex | 4 | 12-01-2006 01:14 PM |