Quote:
Originally Posted by Sabardeyn
I've lost the exact topic location, but read a MobileRead topic named "Tyrannosaurus Regex". I know I posted something last year in there about doing some automated input.
However, ultimately, nothing will work for you completely. You've got files which appear to be named using many different formulas. So you'll need many different filters and import selectively.
|
Well the python code in the old topic does a pretty darn good job. It even distinguishes between titles that don't have a series in the filename.
The only thing I really want out of it now is for it to ignore the crap after the title in the file names. The stuff like "(.html.jpg v1.0)"
Is there a way to have that expression specifically ignore stuff in parenthesis when it's trying to add stuff for the title?
For reference
here is the old post.
Edit, and here is the expression
PHP Code:
(?P<author>((?!\s-\s).)*)\s-(?:\s((?P<series>.+) (?P<series_index>\d+)((?!\s-\s).)*)\s-)?\s(?P<title>.*)