View Single Post
Old 06-20-2008, 02:31 AM   #3
OrcaBlue
Groupie
OrcaBlue knows what time it isOrcaBlue knows what time it isOrcaBlue knows what time it isOrcaBlue knows what time it isOrcaBlue knows what time it isOrcaBlue knows what time it isOrcaBlue knows what time it isOrcaBlue knows what time it isOrcaBlue knows what time it isOrcaBlue knows what time it isOrcaBlue knows what time it is
 
Posts: 189
Karma: 2190
Join Date: Aug 2007
Device: Sony PRS-500
Thanks for the suggestion, edembowski.

Yes, I thought about using sed/awk or Perl to accomplish this. I will need to spend sometimes learning how to use these tools first. In order to access ProQuest, I have to login via my library website. You're are right about it using cookies. When I directly fetch the site via RSS Feed which is an available options, I was prompt for a login and username which I don't have. Hence, the only to access the document is via my library's redirection.

For now, I am abled to get all the articles and create a table of content using Calibre. This include removing excess pages elements that I don't want manually. Then download a local copy and combine them using ScrapBook (Mozilla's Plugins). Also, add a keyword so calibre can search at the beginning of each article and create TOC. It is a clumsy hack. Also, due to more than 150 items in the TOC; it takes at least a few minutes to jump from document to TOC and then back. This is a some what clumsy hack.

In the future attempt, I might reduce the number of articles that I want to download dued to the fact that I can't really read all of them in one day.

In any case, thanks for the tips. I will post any additional progress later.
OrcaBlue is offline   Reply With Quote