View Single Post
Old 01-23-2017, 04:51 AM   #172
Teom@n
Enthusiast
Teom@n began at the beginning.
 
Posts: 47
Karma: 10
Join Date: Dec 2014
Location: Lyon
Device: Kindle PW3, Kobo Libra H2O
Quote:
Originally Posted by Doitsu View Post
Unfortunately, I can't help you with that, because scripts and commands will vary depending on the exact input and output formats. However, BS4 is well documented. For example:

Code:
soup.get_text()
will strip all tags from an HTML file.

If you're not a Python programmer, you could also use a text editor with regular expressions support, e.g. Notepad++, to remove unwanted tags or convert them to a different format.
Thanks mate.
Teom@n is offline   Reply With Quote