Quote:
Originally Posted by meghane_e
These are downloaded unprocessed files (using Firefox>File>Save), which means login request info is still on it, along with wrappers, comments, etc. If I use BeautifulSoup to strip out the chapter name (<article>.<h3>) and story content (<article>.<p>) in each file and stuff those in a simple HTML page, could it be used to then build a book?
|
I guess my question might boil down to, do I need to write an adapter for my local files from each site? There are dozens of chapters, each in a separate file, per book, which is why I ask if an adapter is what I would want.
Also, the file name format in my first post is slightly off
file:///D:/Scripts/Python/test%20dirs/Dark%20Earth%201%20-%20Dark%20Champion/Dark%20Champion%20Chapter%201.html
so ...
file:///D:/scripts/python/test%20dirs/<series - book name>/<book name>%20Chapter%20<#>
Thanks!