Quote:
Originally Posted by Tommy
pretty cool, your hack downloads the pages behind the feeds, mine does no longer. I removed this feature, as I failed to nicely de-htmlise the pages. I could strip off the tags, but I could find a means to extract the "content of the article", and so all the nav-bars, ads etc were still present. The output - especially the LaTeX - was just ugly
But if there's someone interested in this feature, just let me know...
|
Well... Thats the reason why I avoided Latex as the intermediate file. I use htmldoc to produce a temporary pdf and then glue the pdfs together and add links to each page. Its an evolution of my perl script which does the same thing, but outputs in html.
When I get a moment I'll post up the php file that does this...