02-18-2023, 04:52 PM | #1 |
Junior Member
Posts: 3
Karma: 10
Join Date: Feb 2023
Location: New York CIty
Device: kobo clara hd
|
How to convert a webpage article into an EPUB files
Hello from New York City!
Is it possible create an EPUB file for a websites article with Sigil? Because reading long form articles on an e-book reader instead of a computer monitor is easier for my eyes. Tried Calibre and a browser extension without success so far. Need a new solution. Since the website does not have an RSS feed, Calibre news feed feature cannot be used. The browser extension dotEPUB does not work well because the website has too many images. In my humble experience, it is only good for text heavy articles where the images are not needed. But in my case, article images are needed. See dotEPUB error message at top center of the page: "This page has too many images .... We cannot include them all, sorry ...." This is the articles being used. How can Sigil be used to converting an HTML web pages to EPUB from this particular website? Any clues appreciated. Thank you. |
02-18-2023, 06:34 PM | #2 |
Wizard
Posts: 1,165
Karma: 4917718
Join Date: Sep 2021
Location: Australia
Device: Kobo Libra 2
|
You mean something like this in the image...
Sorry this is in Calibre, not Sigil. |
02-18-2023, 06:56 PM | #3 |
Sigil Developer
Posts: 7,736
Karma: 5446592
Join Date: Nov 2009
Device: many
|
Although Sigil can add existing html/xhtml to an empty epub these are from files typically located on your local machine. Have you tried saving the website as an offline archive with your desktop machine browser? Alternatively command line programs like curl can be used to grab websites as well.
The problem is most websites are loaded with active page elements via javascript that can be a pain to unravel and clean up not to mention crap like remote advertising and tracking. Properly cleaning this up would be a headache for something that is only going to be read once. Have you checked to see if your e-reading device supports a browser? If so that may be a better solution for short term reading. |
02-18-2023, 09:07 PM | #4 |
null operator (he/him)
Posts: 20,678
Karma: 26966376
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Couple of dotEPUB alternatives
https://github.com/alexadam/save-as-ebook https://epub.press/ About a decade ago I used another one named something like "GrabMyPage", it was good at ignoring the garbage, although javascript wouldn't have been as common then. IIRC, an MR member recommended it . BR |
02-19-2023, 03:19 AM | #5 |
Guru
Posts: 661
Karma: 4568205
Join Date: Jan 2010
Location: Sweden
Device: Kobo Forma
|
One trick around ads and javascript stuff, can be to use Pocket. Have Pocket get the article, and then you save the article from your Pocket archive. Much cleaner version.
(Or get a Kobo and use Pocket automatically.) |
Tags |
conversion for epub, html conversion |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Automatic convert webpage to epub? | bthoven | Calibre | 35 | 11-12-2010 03:48 AM |
Txt files - Convert to Epub - Multiple files into one book - noob help | Cernan | Calibre | 6 | 05-18-2010 10:12 AM |