Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Formats > Other formats > LRF

Notices

Reply
 
Thread Tools Search this Thread
Old 01-17-2008, 04:50 AM   #136
slav
Member
slav began at the beginning.
 
Posts: 16
Karma: 10
Join Date: Sep 2007
Device: PRS-500
the problem is that they are being converted, so they produce unknown tags like:

Code:
<ThatsMyXMLTag>  text inside my tag  </ThatsMyXMLTag>
produces in output temp html:

Code:
<ThatsMyXMLTag>   text inside my tag  </ThatsMyXMLTag>
and then web2lrf tries to convert that to lrf and nothing is displayed (at least that's what I think)

I saw in demo.html file that you put this into <code> tags, that's why I was trying this regex...

thanks!
slav is offline   Reply With Quote
Old 01-17-2008, 11:58 AM   #137
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 26,323
Karma: 5382313
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Unknown tags in an HTML file are ignored, i.e. html2lrf treats <unknown>some text</unknown> as some text. So I don't think that is the problem. Are the &lt; entities in the HTML or the RSS feed itself?
kovidgoyal is offline   Reply With Quote
 
Advertisement
Old 01-18-2008, 07:04 AM   #138
slav
Member
slav began at the beginning.
 
Posts: 16
Karma: 10
Join Date: Sep 2007
Device: PRS-500
in RSS feed itself see this feed for example:
http://feeds.feedburner.com/netslave

Add or remove the www sub domain post contains lots of source code
all sections like:
Code:
<httpModules>
  <add type="WwwSubDomainModule" name="WwwSubDomainModule" />
</httpModules>
are not in output LRF file.

even the
Code:
/// <summary> 
/// Handles the BeginRequest event of the context control. 
/// </summary> 
/// <param name="sender">The source of the event.</param>
in output LRF appears as:
Code:
/// Handles the BeginRequest event of the context control.
note that it happens even if I dont have preprocess_regexps defined at all.
slav is offline   Reply With Quote
Old 01-18-2008, 12:05 PM   #139
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 26,323
Karma: 5382313
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
preprocess_regexps only acts n the downloaded HTML files. not on the RSS file iteslf. If you want to change the handling on the <description> tag in the RSS file do two things

set

Code:
html_description = True
If you still dont like the handling, override the process_html method in your sub class.
kovidgoyal is offline   Reply With Quote
Old 01-18-2008, 12:43 PM   #140
slav
Member
slav began at the beginning.
 
Posts: 16
Karma: 10
Join Date: Sep 2007
Device: PRS-500
I'm not concerned about description, my only problem is that some lines are missing from output LRF file, but as you say I'll try to override process_html and see how it goes.

Thanks!
slav is offline   Reply With Quote
Old 01-18-2008, 12:47 PM   #141
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 26,323
Karma: 5382313
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
The contents of the LRF file are taken from the <description> tag, so you should be concerned about it
kovidgoyal is offline   Reply With Quote
Old 01-21-2008, 03:19 PM   #142
Dominik
Junior Member
Dominik began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Jan 2008
Location: Hamburg, Germany
Device: PRS-505
Full Feed

Hi Kovid,

is it possible to use web2lrf with a full feed? For example, all Feedburner feeds have <content:encoded>-tags containing the whole article. Therefore, it is unnecessary to look for a print version of the article and preprocess the HTML.

How can I get web2lrf to use the <content:encoded> instead of the article's URL?

I tried to set the "html_description" property to true and reimplement the parse_feed function to use the <content:encoded>-tag instead of <description>. This worked, but it's complicated and it's impossible to look over the articles quickly because there is no table of contents with links to the full article.

Dominik
Dominik is offline   Reply With Quote
Old 01-21-2008, 03:21 PM   #143
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 26,323
Karma: 5382313
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Support for content-embedded feeds is on my TODO list. It's now added in svn, will be in the next release.

Last edited by kovidgoyal; 01-22-2008 at 01:13 PM.
kovidgoyal is offline   Reply With Quote
Old 01-24-2008, 06:18 AM   #144
slav
Member
slav began at the beginning.
 
Posts: 16
Karma: 10
Join Date: Sep 2007
Device: PRS-500
Quote:
Originally Posted by kovidgoyal View Post
Support for content-embedded feeds is on my TODO list. It's now added in svn, will be in the next release.
Any ideas when you'll be ready with this new release - I'd love to put my hands on new web2lrf

btw. I've seen new version of DefaultProfile in svn - is there a way to force existing version of web2lrf to use it?
slav is offline   Reply With Quote
Old 01-24-2008, 01:23 PM   #145
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 26,323
Karma: 5382313
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Not easily. The new release should be out soon.
kovidgoyal is offline   Reply With Quote
Old 01-24-2008, 02:08 PM   #146
slav
Member
slav began at the beginning.
 
Posts: 16
Karma: 10
Join Date: Sep 2007
Device: PRS-500
thanks, I'll probably wait then
slav is offline   Reply With Quote
Old 01-24-2008, 04:10 PM   #147
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 26,323
Karma: 5382313
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Released v0.4.34 with a GUI for adding custom profiles and support for content embedded profiles via the class FullContentProfile
kovidgoyal is offline   Reply With Quote
Old 01-24-2008, 05:28 PM   #148
slav
Member
slav began at the beginning.
 
Posts: 16
Karma: 10
Join Date: Sep 2007
Device: PRS-500
slav is offline   Reply With Quote
Old 01-24-2008, 05:39 PM   #149
AJ@PR
Junior Member
AJ@PR began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Jan 2008
Device: Sony 505
Hello everyone...

New to the forums.

Just downloaded the software... gonna give it a go now.
Will update soon.
Thank you for this!

-- AJ
/



EDIT UPDATE:::
Ok, installed it.
1- Thanks! This thing is nifty!
2- Wow! I just found out the Reader has 200MB internal. =\
3- FINALLY I can edit the meta data of the files... w00t w00t!! !
4- I went to add a directory on another hdd (non C:\>), and it error'd me:
Quote:
directories
Detailed traceback:
Traceback (most recent call last):
File "main.py", line 723, in do_config
AttributeError: directories
Let me know if I can help in any way!
(no programing skills)

Last edited by AJ@PR; 01-24-2008 at 05:54 PM.
AJ@PR is offline   Reply With Quote
Old 01-24-2008, 06:07 PM   #150
AJ@PR
Junior Member
AJ@PR began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Jan 2008
Device: Sony 505
^^^ Ok, no worries!

Everything works fine... well, "fine" being a relative term.
I love the anti-gravity sand.

Again, THANKS!
AJ@PR is offline   Reply With Quote
Reply

Tags
libprs500, web2lrf

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
web2lrf to capture blog archive? Deputy-Dawg Sony Reader Dev Corner 1 02-15-2008 12:41 AM
web2lrf: La Repubblica alexxxm Sony Reader 1 11-13-2007 01:27 PM


All times are GMT -4. The time now is 08:40 PM.


MobileRead.com is a privately owned, operated and funded community.