|
|
#1 |
|
Junior Member
![]() Posts: 3
Karma: 10
Join Date: Jun 2011
Device: none
|
Help with regular expressions
Hi,
I have some books that have been named in the following manner. title [author, published year]{some text}.pdf e.g. Rethink ~ Cut Costs Boost Innovation [Ric Merrifield, 2009]{Summary}.pdf I want to add these correctly in calibre i.e. get title, author and published year information, while ignoring the string {Summary}. What would be the regular expression for this? I am a complete newbie here so please help. |
|
|
|
|
|
#2 |
|
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,130
Karma: 91256
Join Date: Feb 2008
Location: Germany
Device: Cybook Gen3
|
There's a tutorial for regular expressions available. I'm a little short on time right now, so I can't be more helpful, but I should be able to write a little more by tomorrow, if noone else has done it by then.
|
|
|
|
|
|
#3 |
|
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
|
|
|
|
|
|
#4 |
|
Junior Member
![]() Posts: 3
Karma: 10
Join Date: Jun 2011
Device: none
|
Thanks, that worked! Is this standard python? Asking since I want to start doing this myself at some point in time.
|
|
|
|
|
|
#5 |
|
Junior Member
![]() Posts: 3
Karma: 10
Join Date: Jun 2011
Device: none
|
Actually, I have a followup question. I have many books currently named such that the tags are included in their names. The tags are always included in curly braces. e.g. consider the previous book I had mentioned.
Rethink ~ Cut Costs Boost Innovation [Ric Merrifield, 2009].pdf If there were tags for this book, the name of the book would be as follows: Rethink ~ Cut Costs Boost Innovation [Ric Merrifield, 2009]{Innovation, Cost Control}.pdf How would the regular expression be changed to import the tags as well? This would save me a huge amount of work to classify articles and books. Thanks in advance. BTW, I tried the following: (?P<title>[^_-]+) \[(?P<author>[^_0-9-]*), (?P<published>[0-9]*)] \{(?P<tags>[^_0-9-]*)} but that just imports the entire filename as title, which does not help me at all! I also used <tag> for tag. Any pointers? Last edited by jevonbrady; 06-20-2011 at 09:04 PM. Reason: adding details |
|
|
|
|
|
#6 |
|
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
|
|
|
|
|
|
#7 | |
|
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
Quote:
|
|
|
|
|
![]() |
| Tags |
| regular expressions |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Problem with regular expressions | Manichean | Conversion | 10 | 02-03-2011 03:27 PM |
| An introduction to regular expressions | Manichean | Conversion | 0 | 01-26-2011 06:05 PM |
| Help with Regular Expressions | ghostyjack | Workshop | 2 | 01-08-2010 12:04 PM |
| Regular Expressions help needed | Phil_C | Workshop | 20 | 10-03-2009 01:14 AM |
| BookDesigner v5 and regular expressions | ShineOn | Sony Reader | 11 | 08-25-2008 05:06 PM |