View Single Post
Old 11-04-2010, 10:38 AM   #5
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by grandin View Post
Thanks, Starson.

I figured that the good ISBN would allow me to recupe the relevant title, author, and publisher data, whether or not it was already in the filename. I'm willing to try for that brute force method before going so far as to parse the other elements by a regex.

Here's a couple of filenames:
0262083558.The.MIT.Press.Ham.Radios.Technical.Cult ure.Dec.2006.pdf
0520233085.University.of.California.Press.The.Hors e.and.Jockey.from.Artemision.A.Bronze.Equestrian.M onument.of.the.Hellenistic.Period.Jul.2004.pdf
041530329X.Routledge.Politics.The.Basics.Jul.2004. pdf

Mostly academic titles, all from the same source.

Many thanks to whoever can lend a hand.
Try this:
Code:
(?P<isbn>.+?)\.(?P<title>.+)
You can't easily get a correct title or publisher given the format in the filenames. Just let it overwrite during a bulk metadata fetch.
Starson17 is offline   Reply With Quote