Thanks for the suggestion, edembowski.
Yes, I thought about using sed/awk or Perl to accomplish this. I will need to spend sometimes learning how to use these tools first. In order to access ProQuest, I have to login via my library website. You're are right about it using cookies. When I directly fetch the site via RSS Feed which is an available options, I was prompt for a login and username which I don't have. Hence, the only to access the document is via my library's redirection.
For now, I am abled to get all the articles and create a table of content using Calibre. This include removing excess pages elements that I don't want manually. Then download a local copy and combine them using ScrapBook (Mozilla's Plugins). Also, add a keyword so calibre can search at the beginning of each article and create TOC. It is a clumsy hack. Also, due to more than 150 items in the TOC; it takes at least a few minutes to jump from document to TOC and then back. This is a some what clumsy hack.
In the future attempt, I might reduce the number of articles that I want to download dued to the fact that I can't really read all of them in one day.
In any case, thanks for the tips. I will post any additional progress later.
|