Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 03-25-2010, 02:32 PM   #1
ereader123
Junior Member
ereader123 began at the beginning.
 
Posts: 4
Karma: 10
Join Date: Mar 2010
Device: Kindle
Converting PDFs from Packt Publishing

I used calibre 0.6.45 (Linux) to convert a pdf ebook "Python Testing" from Packt and it is about 99% OK. There are a couple of problems:

1. Every page in the pdf has a footer with a Packt logo and the sentence "This material is copyright and is licensed for the sole use by (me) on 19th March 2010 (then my address). The image and sentence appear in various locations on each ebook page that I read in my Kindle 2. The logo is about 1" tall and 2" wide.
2. The header on each page (the chapter title) is repeated throughout the pages as a line of text.
Is there a way to tell Calibre to ignore the headers and footers in the pdf so I can remove these rather annoying bits of text and images?

3. Bulleted lists are missing the bullet and each line has an extra line break. Is there a way to help Calibre do a better job of formatting the bullets?

4. There are many areas in the book where the author lists a sequence of steps, and then displays an image of the screen output from completing the steps. For some reason, the image of the screen output is placed before the sequence of steps in the converted file. Is this normal?

I thought of looking in the recipes, but there are too many to manually search and the forum search did not turn up anything useful for me.

I am very new to calibre and ebooks in general, so please forgive me if I have asked some dumb questions.

Thanks!
ereader123 is offline   Reply With Quote
Old 03-25-2010, 03:10 PM   #2
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by ereader123 View Post
I used calibre 0.6.45 (Linux) to convert a pdf ebook "Python Testing" from Packt and it is about 99% OK. There are a couple of problems:
PDF conversions always have problems. PDF is a tough format to convert to an ebook. A new convererter is being written, but until then ....

Quote:
1. Every page in the pdf has a footer with a Packt logo and the sentence "This material is copyright and is licensed for the sole use by (me) on 19th March 2010 (then my address). The image and sentence appear in various locations on each ebook page that I read in my Kindle 2. The logo is about 1" tall and 2" wide.
2. The header on each page (the chapter title) is repeated throughout the pages as a line of text.
Is there a way to tell Calibre to ignore the headers and footers in the pdf so I can remove these rather annoying bits of text and images?
Yes, look at the conversion options. I'm not the person to tell you how to do this, but there is regex control for removing header/footers and misc text. Searching on pdf and header or footer should help. Also regular expressions or regex and PDF.

Quote:
3. Bulleted lists are missing the bullet and each line has an extra line break. Is there a way to help Calibre do a better job of formatting the bullets?
I've seen this, too. Anyone who answers you will being helping me (actually, my wife who converts more than I do), too.

Quote:
I thought of looking in the recipes, but there are too many to manually search and the forum search did not turn up anything useful for me.
This isn't a recipe issue. It's a converter plugin issue. Someone with experience in PDF conversion should be along shortly
Starson17 is offline   Reply With Quote
Advert
Old 03-26-2010, 12:57 AM   #3
animedude01
Addict
animedude01 can extract oil from cheeseanimedude01 can extract oil from cheeseanimedude01 can extract oil from cheeseanimedude01 can extract oil from cheeseanimedude01 can extract oil from cheeseanimedude01 can extract oil from cheeseanimedude01 can extract oil from cheeseanimedude01 can extract oil from cheeseanimedude01 can extract oil from cheese
 
animedude01's Avatar
 
Posts: 254
Karma: 1200
Join Date: Jul 2009
Location: Los Angeles
Device: DR1000S, ILIAD2, Nokia n900, Kindle for PC, Astak EZReader Pro
PDF with specific formating likely won't convert well without a lot of cleanup. Maybe you can bug them about adding another format like epub or think about getting a larger screen reader.
animedude01 is offline   Reply With Quote
Old 03-26-2010, 08:23 PM   #4
ereader123
Junior Member
ereader123 began at the beginning.
 
Posts: 4
Karma: 10
Join Date: Mar 2010
Device: Kindle
Thanks for the info. I sent Packt a note asking for other ebook formats, and I have been experimenting with the regex for removing footers. I will start a new thread with my regex questions.
ereader123 is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Converting PDFs macrotor PDF 62 08-14-2011 07:10 PM
Converting PDFs JoshLessard Amazon Kindle 12 10-07-2010 06:40 AM
Converting Layered? PDFs kerrware Calibre 2 06-30-2010 03:31 PM
reader for PDFs without converting? kuck Which one should I buy? 24 06-30-2010 02:55 AM
Numbers in pdfs not converting kilgoretrout Workshop 9 06-25-2010 05:18 PM


All times are GMT -4. The time now is 09:58 AM.


MobileRead.com is a privately owned, operated and funded community.