View Single Post
Old 05-24-2010, 09:29 AM   #357
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by Yeti View Post
Interesting to notice how some things - like the offending text we are talking about here - don't show up in the PDF before conversion and then suddenly appear in the MOBI afterwards ...
I believe that means it's been hidden in the pdf. It's there, but not being displayed until conversion makes it reappear.

Quote:
I just noticed also that neither the PDF before conversion, nor the MOBI afterwards have any italic print. I have the paper version of this book and, like all books it uses italics for emphasis, to indicate someone's train of thought, for foreign language and so on. This is quite important for a better understanding of the story, and would be nice to correct if possible too. But quite likely it was lost in creating the original PDF version?
Yes, it was probably stripped during conversion. I don't know why, as a good conversion wouldn't have done that.

Quote:
Now, trying to get rid of the extra page breaks:
I tried using the expression <br>Generated by.*abclit.html<br> , but it does not highlight anything in the wizard.
I didn't think it would. Without seeing the text you want removed, and the codes around it, that was just a guess.

Quote:
I also tried leaving off the <br> , first at the start, then at the end - no luck, it does not highlight anything.
That's also not surprising - you don't have any <br> codes

Quote:
Here is a copy-and-paste of a section of the text from the wizard after using the expression Generated by.*abclit.html :

... Central Intelligence Agency. He <b>Generated by ABC Amber LIT Conv<a href="http://www.processtext.com/abclit.html">erter, http://www.processtext.com/abclit.html</a></b></p><p>
was also at this moment ...
Try this:

Code:
<b>Generated by.*abclit.*<p>
That may not do it, as I don't see the part causing the break. I think that will just remove some empty bold tags, and an empty paragraph - extra line. The part causing the page break may be in a part of the text you didn't post. If it's not bothering you, you don't need to go any further, but learning a bit about basic regex use can be helpful if you are going to use Calibre over the long term.
Starson17 is offline