View Single Post
Old 05-17-2011, 11:16 AM   #10
leehach
Member
leehach began at the beginning.
 
Posts: 10
Karma: 10
Join Date: May 2011
Device: Kindle DX
Quote:
Originally Posted by Manichean View Post
I haven't run Calibre on Linux, but I believe you only need SU privileges when installing a new version.
Great, just installed 0.8.1. I'll test it and see how it goes.

Quote:
Originally Posted by Manichean View Post
Try something like
Code:
(?P<author>[^_]+)(,? et al)? (?P<published>\d{4})? - (?P<title>.+)
That should at least give you the possible fields from all the files except the last two. I don't know what can be done about those without getting all horribly complicated, It'd probably be easier to just accept that you'll have to do a certain part manual work on your metadata.
The et al group had no effect. This gave me an idea though. Both of the following work, does one or the other seem "safer" to you?
Code:
(?P<author>[^_]+)(,? ?e?t? ?a?l?) (?P<published>\d{4}) - (?P<title>[^_]+)
  -or-
(?P<author>[^_]+)(, et al|) (?P<published>\d{4}) - (?P<title>[^_]+)
These both successfully strip out ", et al", while not interfering when ", et al" was not present.

Meanwhile, I understand that some manual work will be necessary, just throwing in the last two filenames so you could see what I was working with. The above expression brings those in as author=Unknown and title=the complete filename, which is as good as I can expect.

Thanks,
--Lee
leehach is offline   Reply With Quote