Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre > Conversion

Notices

Reply
 
Thread Tools Search this Thread
Old 08-04-2011, 12:13 PM   #1
Josieb1
Wizard
Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.
 
Josieb1's Avatar
 
Posts: 1,586
Karma: 2563284
Join Date: Nov 2009
Location: UK
Device: KPW2, Kobo Aura, Nook Glow, Ipad Air, Ipad Mini 2, IPhone 5
Regular Expressions help needed

Hi I was wondering if someone can help me?

I have over 100 PDF books to convert to mobi files but every one of those has the author, book title and page number on each page of the PDF.

My current process is to manually convert each book to a RTF then do Find/Replace to remove the erroneous details and then manually 'lift' up the text where the removal of those details leaves a gap. I have been told I can use Regular Expressions in Calibre to do this, at least get rid of the author name, book title and page number, but I have no idea how to do it.

I have read the tutorial a few times now but its total gibberish to me, i just don't understand it.

Is it possible for someone to write an expression for me? I would learn much easier with a written example I could understand and copy.

Thanks
Josieb1 is offline   Reply With Quote
Old 08-04-2011, 01:00 PM   #2
theducks
Grand Sorcerer
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 14,646
Karma: 5629001
Join Date: Aug 2009
Location: (The original) Silicon Valley, USA
Device: Galaxy Tab 2, Astak Pocket Pro, K4NT
Quote:
Originally Posted by Josieb1 View Post
Hi I was wondering if someone can help me?

I have over 100 PDF books to convert to mobi files but every one of those has the author, book title and page number on each page of the PDF.

My current process is to manually convert each book to a RTF then do Find/Replace to remove the erroneous details and then manually 'lift' up the text where the removal of those details leaves a gap. I have been told I can use Regular Expressions in Calibre to do this, at least get rid of the author name, book title and page number, but I have no idea how to do it.

I have read the tutorial a few times now but its total gibberish to me, i just don't understand it.

Is it possible for someone to write an expression for me? I would learn much easier with a written example I could understand and copy.

Thanks
We will assume you also read this: http://www.mobileread.com/forums/sho...d.php?t=118605
REGEX is not a ONE SIZE FITS ALL, it needs to be crafted to exactly fit your conditions or it can also remove good stuff, remove a portion, now making a easy job very difficult because a key part of the exact pattern has been flushed. (another way of saying that doing all the right matches in the wrong order can hurt you)
I prefer to use Sigil, where I get to see the found and decide if I want to replace that occurance (and did my replace work as expected )
Pages can have a Right and a Left version (2 patterns needed)
theducks is offline   Reply With Quote
 
Enthusiast
Old 08-04-2011, 01:22 PM   #3
Josieb1
Wizard
Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.Josieb1 ought to be getting tired of karma fortunes by now.
 
Josieb1's Avatar
 
Posts: 1,586
Karma: 2563284
Join Date: Nov 2009
Location: UK
Device: KPW2, Kobo Aura, Nook Glow, Ipad Air, Ipad Mini 2, IPhone 5
thank you for your reply, yes I had read that page and even downloaded mobipocket creator but I didn't find that program very useful. Looks like I'll just stick to converting them manually for now. Thanks
Josieb1 is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Regular Expressions geormes Calibre 4 08-04-2011 07:09 AM
Another help with regular expressions encapuchado Library Management 6 06-21-2011 03:14 PM
Help with regular expressions jevonbrady Library Management 6 06-21-2011 10:16 AM
Help with Regular Expressions ghostyjack Workshop 2 01-08-2010 11:04 AM
Regular Expressions help needed Phil_C Workshop 20 10-03-2009 12:14 AM


All times are GMT -4. The time now is 04:31 PM.


MobileRead.com is a privately owned, operated and funded community.