![]() |
#1 |
Zealot
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 131
Karma: 150390
Join Date: Nov 2011
Location: Pacific NorthWest
Device: Kindle Fire
|
Setting actual content?
I am writing a recipe for a somewhat complex website. For one set of "articles", I wish to handle the parsing myself or give it much closer attention than the others.
My recipe is generating a list of articles inside parse_index(); most of these have empty content elements and appropriate URLs. But the URLs are not to print editions (as documented here), so I'm wanting to do additional clean-up and munging, and then set the content on some of them. I have to dive into their contents anyhow to correct extract useful titles, so getting down to a relevant table or div isn't much extra effort, and should eliminate unwanted ads. Initially I thought I could set the content element of the articles that are returned by parse_index(), but that doesn't work; it looks like it's only used for the nebulous FullContentProfile, which isn't referenced anywhere else. I'm probably missing a pretty key concept. How can I use the parse_index() processing for most of the "feeds" and yet provide article text for some? (Alternatively, how can I know what tuple Title I'm looking at in preprocess_html() if that's really the appropriate solution... though it seems less obvious to soup it and then wait for it to be processed again.) Thanks! Last edited by TechnoCat; 01-06-2012 at 08:20 PM. |
![]() |
![]() |
![]() |
#2 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,304
Karma: 27111242
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Use any of the methods: preprocess_html, preprocess_raw_html, postprocess_html
|
![]() |
![]() |
Advert | |
|
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Help with setting up a content server | siddardha | Calibre | 7 | 08-27-2011 01:07 PM |
Where is the actual epub file? | Fayth | Sony Reader | 11 | 12-28-2010 01:20 AM |
Troubleshooting Actual battery life K3 WiFi | Kumabjorn | Amazon Kindle | 21 | 11-16-2010 08:31 PM |
PRS-300 Actual Screen Dimensions | jagfan | Sony Reader | 1 | 02-07-2010 02:09 PM |
Actual book selection... | radamo | Which one should I buy? | 6 | 03-25-2009 07:54 PM |