08-27-2011, 05:23 AM | #1 |
Junior Member
Posts: 1
Karma: 10
Join Date: Aug 2011
Device: pocket edge
|
Converting scholarly articles
I spend an inordinate amount of my time reading scholarly articles on my ereader.
PDFs on eink are OK-ish, but not that great (no or bad reflow, formatting intended for printing, sometimes terrible choice of font, etc). I'd need to convert my article collection to epubs. Converting from PDF to epub is notoriously finicky, so I'd like to convert the html available from the publisher's website, using a recipe to trim down the fat (headers, footers, generally useless crap), convert tables, and use a decent-resolution image for figures. From the GUI, I tried to use the newsreader interface to download and convert an article ; however (unsurprisingly, since it's how it was designed) it only accepts RSS feeds as an input. (or, am I missing something?) I nonetheless managed to download the page in question, but I ended up with raw HTML in my epub. Apparently, not the right approach... I could use wget to download the page (and maybe the css files?), but I would need to do some additional processing to find which image files I need to download and incorporate - thus negating a big advantage of recipes. Another option might be to use web2disk to download the webpage + recursion to 1 or 2 levels, and then convert from html to epub? So, to make a long story short: is there a way to use a recipe on a webpage that is not a RSS feed? Or another way to address my problem? |
08-28-2011, 05:51 PM | #2 |
Enthusiast
Posts: 25
Karma: 1896
Join Date: Aug 2011
Device: Kindle 3
|
Hello. Yes, there is, using the parse_index feature. See the recipe forTHE ATLANTIC as an example...
|
Advert | |
|
08-29-2011, 05:40 AM | #3 |
Wizard
Posts: 1,669
Karma: 2300001
Join Date: Mar 2011
Location: Türkiye
Device: Kindle 5.3.7
|
or you can make your own rss with http://open.dapper.net/dapp-factory.jsp , https://feedburner.google.com/ ,...etc
|
09-02-2011, 02:57 PM | #5 | |
Enthusiast
Posts: 25
Karma: 1896
Join Date: Aug 2011
Device: Kindle 3
|
Quote:
Another (similar) option is to install a Firefox Extension called "Read it later", and use the recipe for it, that's built-in in Calibre. |
|
Advert | |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Converting academic PDF journal articles for Kindle | zarbiya123 | Calibre | 12 | 12-13-2011 10:06 PM |
Scholarly Papers - tips? | fcoulter | Calibre | 8 | 06-16-2011 12:14 PM |
Device for reading scholarly journals | Cheez-It | Which one should I buy? | 35 | 10-12-2009 05:29 AM |
Best eBook store for scholarly texts | thibaulthalpern | News | 3 | 03-17-2009 10:53 PM |
Scholarly publications better in electronic form | Bob Russell | News | 29 | 11-09-2007 02:14 PM |