Quote:
Originally Posted by Teom@n
could you share the script/commands that you were using for converting html to text? I have a html file(124mb) which I took it from a mobi file. I installed the python and beautifulsoap.
|
Unfortunately, I can't help you with that, because scripts and commands will vary depending on the exact input and output formats. However, BS4 is
well documented. For example:
will strip all tags from an HTML file.
If you're not a Python programmer, you could also use a text editor with
regular expressions support, e.g.
Notepad++, to remove unwanted tags or convert them to a different format.