01-17-2008, 03:50 AM | #136 |
Member
Posts: 16
Karma: 10
Join Date: Sep 2007
Device: PRS-500
|
the problem is that they are being converted, so they produce unknown tags like:
Code:
<ThatsMyXMLTag> text inside my tag </ThatsMyXMLTag> Code:
<ThatsMyXMLTag> text inside my tag </ThatsMyXMLTag> I saw in demo.html file that you put this into <code> tags, that's why I was trying this regex... thanks! |
01-17-2008, 10:58 AM | #137 |
creator of calibre
Posts: 43,967
Karma: 22669822
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Unknown tags in an HTML file are ignored, i.e. html2lrf treats <unknown>some text</unknown> as some text. So I don't think that is the problem. Are the < entities in the HTML or the RSS feed itself?
|
Advert | |
|
01-18-2008, 06:04 AM | #138 |
Member
Posts: 16
Karma: 10
Join Date: Sep 2007
Device: PRS-500
|
in RSS feed itself see this feed for example:
http://feeds.feedburner.com/netslave Add or remove the www sub domain post contains lots of source code all sections like: Code:
<httpModules> <add type="WwwSubDomainModule" name="WwwSubDomainModule" /> </httpModules> even the Code:
/// <summary> /// Handles the BeginRequest event of the context control. /// </summary> /// <param name="sender">The source of the event.</param> Code:
/// Handles the BeginRequest event of the context control. |
01-18-2008, 11:05 AM | #139 |
creator of calibre
Posts: 43,967
Karma: 22669822
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
preprocess_regexps only acts n the downloaded HTML files. not on the RSS file iteslf. If you want to change the handling on the <description> tag in the RSS file do two things
set Code:
html_description = True |
01-18-2008, 11:43 AM | #140 |
Member
Posts: 16
Karma: 10
Join Date: Sep 2007
Device: PRS-500
|
I'm not concerned about description, my only problem is that some lines are missing from output LRF file, but as you say I'll try to override process_html and see how it goes.
Thanks! |
Advert | |
|
01-18-2008, 11:47 AM | #141 |
creator of calibre
Posts: 43,967
Karma: 22669822
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
The contents of the LRF file are taken from the <description> tag, so you should be concerned about it
|
01-21-2008, 02:19 PM | #142 |
Junior Member
Posts: 3
Karma: 10
Join Date: Jan 2008
Location: Hamburg, Germany
Device: PRS-505
|
Full Feed
Hi Kovid,
is it possible to use web2lrf with a full feed? For example, all Feedburner feeds have <content:encoded>-tags containing the whole article. Therefore, it is unnecessary to look for a print version of the article and preprocess the HTML. How can I get web2lrf to use the <content:encoded> instead of the article's URL? I tried to set the "html_description" property to true and reimplement the parse_feed function to use the <content:encoded>-tag instead of <description>. This worked, but it's complicated and it's impossible to look over the articles quickly because there is no table of contents with links to the full article. Dominik |
01-21-2008, 02:21 PM | #143 |
creator of calibre
Posts: 43,967
Karma: 22669822
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Support for content-embedded feeds is on my TODO list. It's now added in svn, will be in the next release.
Last edited by kovidgoyal; 01-22-2008 at 12:13 PM. |
01-24-2008, 05:18 AM | #144 | |
Member
Posts: 16
Karma: 10
Join Date: Sep 2007
Device: PRS-500
|
Quote:
btw. I've seen new version of DefaultProfile in svn - is there a way to force existing version of web2lrf to use it? |
|
01-24-2008, 12:23 PM | #145 |
creator of calibre
Posts: 43,967
Karma: 22669822
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Not easily. The new release should be out soon.
|
01-24-2008, 01:08 PM | #146 |
Member
Posts: 16
Karma: 10
Join Date: Sep 2007
Device: PRS-500
|
thanks, I'll probably wait then
|
01-24-2008, 03:10 PM | #147 |
creator of calibre
Posts: 43,967
Karma: 22669822
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Released v0.4.34 with a GUI for adding custom profiles and support for content embedded profiles via the class FullContentProfile
|
01-24-2008, 04:28 PM | #148 |
Member
Posts: 16
Karma: 10
Join Date: Sep 2007
Device: PRS-500
|
|
01-24-2008, 04:39 PM | #149 | |
Junior Member
Posts: 3
Karma: 10
Join Date: Jan 2008
Device: Sony 505
|
Hello everyone...
New to the forums. Just downloaded the software... gonna give it a go now. Will update soon. Thank you for this! -- AJ / EDIT UPDATE::: Ok, installed it. 1- Thanks! This thing is nifty! 2- Wow! I just found out the Reader has 200MB internal. =\ 3- FINALLY I can edit the meta data of the files... w00t w00t!! ! 4- I went to add a directory on another hdd (non C:\>), and it error'd me: Quote:
(no programing skills) Last edited by AJ@PR; 01-24-2008 at 04:54 PM. |
|
01-24-2008, 05:07 PM | #150 |
Junior Member
Posts: 3
Karma: 10
Join Date: Jan 2008
Device: Sony 505
|
^^^ Ok, no worries!
Everything works fine... well, "fine" being a relative term. I love the anti-gravity sand. Again, THANKS! |
Tags |
libprs500, web2lrf |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
web2lrf to capture blog archive? | Deputy-Dawg | Sony Reader Dev Corner | 1 | 02-14-2008 11:41 PM |
web2lrf: La Repubblica | alexxxm | Sony Reader | 1 | 11-13-2007 12:27 PM |