01-11-2013, 09:01 AM | #1 |
Enthusiast
Posts: 40
Karma: 10
Join Date: Oct 2012
Device: Pocket Book Touch HD3
|
python regex: delete text in preprocessing
Hi all,
I am using a calibre recipe (Weltonline; german daily newspaper) to fetch news daily. Everything works fine, but in the final epub file after every news article there is plenty of (web-) rubbish I want to get rid of. Therefore I use Sigil and work on the epub I have downloaded from my calibre server. I delete everything between two string groups: Code:
( <div class="calibre7"> © Axel Springer AG 2013. Alle Rechte vorbehalten)([\s\S ]*?)(Weitere Hinweise</a></li>) Could someone tell me how to use the above regex in the forementioned recipe in a way to use this preprocessing? Thanks, Sebastian |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Regex problem: Trying to replace surrounding text without effecting the middle | ghostyjack | Workshop | 3 | 10-09-2012 04:26 PM |
Kobo Read - Can't delete Text file - txt | fglaysher | Calibre | 0 | 08-15-2010 06:08 PM |
Does de-DRMing delete text to speech restrictions? | Sydney's Mom | Amazon Kindle | 4 | 03-06-2010 11:46 PM |
Preprocessing to PRS-505 from Calibre | jeff363 | Calibre | 7 | 06-02-2008 07:20 AM |
Python Gutenberg E-text Project: PyGE | ignatz | Deals and Resources (No Self-Promotion or Affiliate Links) | 2 | 09-17-2004 01:18 PM |