MobileRead Forums - View Single Post

Starson17 · 11-04-2010, 10:38 AM

Quote:

Originally Posted by grandin

Thanks, Starson.

I figured that the good ISBN would allow me to recupe the relevant title, author, and publisher data, whether or not it was already in the filename. I'm willing to try for that brute force method before going so far as to parse the other elements by a regex.

Here's a couple of filenames:
0262083558.The.MIT.Press.Ham.Radios.Technical.Cult ure.Dec.2006.pdf
0520233085.University.of.California.Press.The.Hors e.and.Jockey.from.Artemision.A.Bronze.Equestrian.M onument.of.the.Hellenistic.Period.Jul.2004.pdf
041530329X.Routledge.Politics.The.Basics.Jul.2004. pdf

Mostly academic titles, all from the same source.

Many thanks to whoever can lend a hand.

Try this:

Code:

(?P<isbn>.+?)\.(?P<title>.+)

You can't easily get a correct title or publisher given the format in the filenames. Just let it overwrite during a bulk metadata fetch.