| Doitsu |
01-22-2017 06:26 PM |
Quote:
Originally Posted by Teom@n
(Post 3462662)
could you share the script/commands that you were using for converting html to text? I have a html file(124mb) which I took it from a mobi file. I installed the python and beautifulsoap.
|
Unfortunately, I can't help you with that, because scripts and commands will vary depending on the exact input and output formats. However, BS4 is well documented. For example:
will strip all tags from an HTML file.
If you're not a Python programmer, you could also use a text editor with regular expressions support, e.g. Notepad++, to remove unwanted tags or convert them to a different format.
|