View Single Post
Old 12-08-2009, 01:30 AM   #3
Nitrousoxide
Enthusiast
Nitrousoxide began at the beginning.
 
Posts: 49
Karma: 10
Join Date: Nov 2009
Device: None
Quote:
Originally Posted by Sabardeyn View Post
I've lost the exact topic location, but read a MobileRead topic named "Tyrannosaurus Regex". I know I posted something last year in there about doing some automated input.

However, ultimately, nothing will work for you completely. You've got files which appear to be named using many different formulas. So you'll need many different filters and import selectively.
Well the python code in the old topic does a pretty darn good job. It even distinguishes between titles that don't have a series in the filename.

The only thing I really want out of it now is for it to ignore the crap after the title in the file names. The stuff like "(.html.jpg v1.0)"

Is there a way to have that expression specifically ignore stuff in parenthesis when it's trying to add stuff for the title?

For reference here is the old post.

Edit, and here is the expression
PHP Code:
(?P<author>((?!\s-\s).)*)\s-(?:\s((?P<series>.+) (?P<series_index>\d+)((?!\s-\s).)*)\s-)?\s(?P<title>.*) 

Last edited by Nitrousoxide; 12-08-2009 at 01:45 AM.
Nitrousoxide is offline   Reply With Quote