![]() |
#1 |
Junior Member
![]() Posts: 5
Karma: 10
Join Date: Nov 2010
Device: iPad
|
![]()
I have done a search but can't see anything mentioning this (sorry if I missed it).
From reading through I understand that converting from PDF's isn't ideal but unfortunately this is what I have. Generally I have got the conversion to work OK (or it seems to at first glance) but it isn't putting page breaks in. So page numbers, headers and footers are put throughout the text. I hope this is making sense I am not the best at describing. ![]() Can anyone point me in the right direction in basic terms that my basic brain can follow e.g. step 1...? (I don't understand programming language) Thanks. |
![]() |
![]() |
![]() |
#2 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,130
Karma: 91256
Join Date: Feb 2008
Location: Germany
Device: Cybook Gen3
|
Page breaks generally make little sense in reflowable formats like ePub, which is why they are (mostly) removed.
Are you trying to preserve the pagebreaks as they are in the PDF file, or do you want to get rid of headers/footers and page numbers? If it's the latter, you might want to take a look at the relevant manual pages. |
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Junior Member
![]() Posts: 5
Karma: 10
Join Date: Nov 2010
Device: iPad
|
Thanks Manichean!
Now to get it to work, so far I am not having much luck. Would it be possible to give me the correct expression to copy and paste? It is to remove the title on the even pages, the author on the odd pages and the page number (just as the number: 1, 2 etc.). I don't know if it makes any difference but I am using Calibre version 0.7.25. I am also wondering if I am putting it the correct place: Structure Detection, header regular expresion, with a tick in remove header. Sorry to be a pain! |
![]() |
![]() |
![]() |
#4 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,130
Karma: 91256
Join Date: Feb 2008
Location: Germany
Device: Cybook Gen3
|
Quote:
|
|
![]() |
![]() |
![]() |
#5 |
Junior Member
![]() Posts: 5
Karma: 10
Join Date: Nov 2010
Device: iPad
|
Thanks for replying.
I have read the info on both links you gave and done a further search and thought I understood but when I try it out it doesn't do anything. Here is a snippet of the book from the wizard: The sprawling Eller-Stapleton Inn, a coaching stop for <br> travelers on the way north, was miles from the nearest town <br> <hr> <A name=8></a><i>8 </i><br> <i>Highland Fling </i><br> and constable. Ordinarily she and her staff took care of their <br> own problems. Her capable innkeeper, Mr. Carson, main*<br> and “Who are they? Did they not give names?” she asked, <br> hoping they had refused. By law, an inn’s patrons had to <br> identify themselves and sign a register to obtain lodgings. <br> <hr> <A name=9></a><i>Betina Krahn </i><br> <i>9 </i><br> “They give names, all right.” Carson glowered, reaching <br> for his big leather register and opening it to the current page. <br> These being the parts I am trying to remove: <hr> <A name=8></a><i>8 </i><br> <i>Highland Fling </i><br> and <hr> <A name=9></a><i>Betina Krahn </i><br> <i>9 </i><br> I hope that after I see how the expression should be written I will understand for the future. I generally do after seeing a working example. Thanks again for your help with this, I really do appreciate it! |
![]() |
![]() |
Advert | |
|
![]() |
#6 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,130
Karma: 91256
Join Date: Feb 2008
Location: Germany
Device: Cybook Gen3
|
Generally, you need to identify the whitespaces and the variable parts. Thus, the page number
Code:
<hr> <A name=8></a><i>8 </i><br> <i>Highland Fling </i><br> Code:
<hr>\s+<A\s+name=\d+></a><i>\d+\s+</i><br>\s+<i>Highland\s+Fling\s+</i><br> Code:
<hr> <A name=9></a><i>Betina Krahn </i><br> <i>9 </i><br> Code:
<hr>\s+<A\s+name=\d+></a><i>Betina\s+Krahn\s+</i><br>\s+<i>\d+\s+</i><br> |
![]() |
![]() |
![]() |
#7 |
Junior Member
![]() Posts: 5
Karma: 10
Join Date: Nov 2010
Device: iPad
|
Thank you!
I understand where I went wrong now and have managed to do another one on my own. ![]() Thanks again. Have a great weekend! |
![]() |
![]() |
![]() |
#8 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,130
Karma: 91256
Join Date: Feb 2008
Location: Germany
Device: Cybook Gen3
|
Glad it worked.
Out of interest, where did you go wrong? |
![]() |
![]() |
![]() |
#9 |
Junior Member
![]() Posts: 5
Karma: 10
Join Date: Nov 2010
Device: iPad
|
I forgot to put the white space between the words in the title and author. Daft I know, I was having a bad day.
![]() |
![]() |
![]() |
![]() |
#10 |
Zealot
![]() ![]() ![]() ![]() Posts: 143
Karma: 387
Join Date: Sep 2010
Device: Kindle 3
|
I apologize, regular expressions are just beyond me (but I did try!). Could a kind soul tell me, how to eliminate these very simple footer lines with a page number in them? E.g. the break from page 7 to 8 looks like this:
Code:
text of last line on previous page <br> <br> 7<br> <hr> <A name=8></a>first line of new page Thanxx, Mixx |
![]() |
![]() |
![]() |
Thread Tools | Search this Thread |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
PDF to ePub in Calibre - input somewhat scrambled | Seanette | ePub | 2 | 11-04-2010 07:34 AM |
PDF output - page size/orientation problems | kurokaze | Calibre | 1 | 09-26-2010 06:08 PM |
PDF to EPUB - spurious paragraph breaks | RichieTheK | Calibre | 2 | 09-08-2010 11:27 AM |
Any way to force page breaks when converting HTML to EPUB | Bierkonig | Calibre | 23 | 10-31-2009 01:51 PM |
PDF to LRF with page breaks | jupinator | Calibre | 0 | 07-27-2009 03:57 PM |