MobileRead Forums - View Single Post

meghane_e · 08-20-2018, 09:43 PM

Quote:

Originally Posted by meghane_e

These are downloaded unprocessed files (using Firefox>File>Save), which means login request info is still on it, along with wrappers, comments, etc. If I use BeautifulSoup to strip out the chapter name (<article>.<h3>) and story content (<article>.<p>) in each file and stuff those in a simple HTML page, could it be used to then build a book?

I guess my question might boil down to, do I need to write an adapter for my local files from each site? There are dozens of chapters, each in a separate file, per book, which is why I ask if an adapter is what I would want.

Also, the file name format in my first post is slightly off

file:///D:/Scripts/Python/test%20dirs/Dark%20Earth%201%20-%20Dark%20Champion/Dark%20Champion%20Chapter%201.html

so ...

file:///D:/scripts/python/test%20dirs/<series - book name>/<book name>%20Chapter%20<#>

Thanks!