Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > ePub

Notices

Reply
 
Thread Tools Search this Thread
Old 05-31-2022, 04:47 AM   #1
Shohreh
Zealot
Shohreh can program the VCR without an owner's manual.Shohreh can program the VCR without an owner's manual.Shohreh can program the VCR without an owner's manual.Shohreh can program the VCR without an owner's manual.Shohreh can program the VCR without an owner's manual.Shohreh can program the VCR without an owner's manual.Shohreh can program the VCR without an owner's manual.Shohreh can program the VCR without an owner's manual.Shohreh can program the VCR without an owner's manual.Shohreh can program the VCR without an owner's manual.Shohreh can program the VCR without an owner's manual.
 
Posts: 148
Karma: 192898
Join Date: Jan 2016
Device: none
Question [pandoc] How to locate HTML file from wget?

Hello,

I like to read long web pages on my e-reader.

I use the following commands to create an ePUB file:

Code:
wget -E -H -k -K -p -e robots=off https://www.acme.com/blah.html

OPTIONAL iconv -f iso-8859-1 -t utf-8 blah.html> blah.UTF8.html

pandoc -t epub2 -o blah.epub blah.UTF8.html
The problem is finding where wget downloads the main HTML file and how it's named, lost somewhere in all those directories that contain the different files needed for offline reading. The goal is to automate the process through a batch file.

Do you know of a solution?

Thank you.
Shohreh is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
HTML input plugin stripping text within toc tags in child html file nimblebooks Conversion 3 02-21-2012 03:24 PM
How to use wget to download an online HTML book amoroso Lounge 11 04-25-2011 05:10 AM
Converting pandoc generated HTML to ePUB with Calibre Wintermute Conversion 2 04-15-2011 01:25 PM
Convert HTML to MOBI (HTML recognized as ZIP file) pdubois Conversion 1 01-25-2011 12:55 PM
html tree via wget -> epub (or other format) maynard Workshop 4 05-13-2009 06:05 PM


All times are GMT -4. The time now is 01:21 PM.


MobileRead.com is a privately owned, operated and funded community.