Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Readers > Amazon Kindle

Notices

Reply
 
Thread Tools Search this Thread
Old 12-14-2010, 04:26 PM   #1
fivebells
Junior Member
fivebells began at the beginning.
 
Posts: 8
Karma: 10
Join Date: Dec 2010
Device: Kindle
PDF conversion which respects images/tables?

I would like to convert PDF documents to the native Kindle format, preferably using linux tools, but windows tools would OK too. Both the Amazon email-based conversion service and Calibre mangle tables and images pretty badly. I've heard that converting from html often works better. pdftohtml does a beautiful job of converting pdfs to html when I pass it the "-c" option. It generates a large number of html and image files which the Amazon conversion service doesn't recognize as a single document. Calibre can work with the output of pdftohtml if I provide it as a zip file, but the mobi-format files it generates still have mangled tables and no images. Is there something else I should try?
fivebells is offline   Reply With Quote
Old 12-14-2010, 04:37 PM   #2
susan_cassidy
Wizard
susan_cassidy ought to be getting tired of karma fortunes by now.susan_cassidy ought to be getting tired of karma fortunes by now.susan_cassidy ought to be getting tired of karma fortunes by now.susan_cassidy ought to be getting tired of karma fortunes by now.susan_cassidy ought to be getting tired of karma fortunes by now.susan_cassidy ought to be getting tired of karma fortunes by now.susan_cassidy ought to be getting tired of karma fortunes by now.susan_cassidy ought to be getting tired of karma fortunes by now.susan_cassidy ought to be getting tired of karma fortunes by now.susan_cassidy ought to be getting tired of karma fortunes by now.susan_cassidy ought to be getting tired of karma fortunes by now.
 
Posts: 2,251
Karma: 3720310
Join Date: Jan 2009
Location: USA
Device: Kindle, iPad (not used much for reading)
If it tries to create HTML tables, I don't think Kindle supports them well, or possibly not at all. If you save the table as a graphic, and embed that via an <img> tag in the html, it might help some. I know if you use Mobipocket Creator, you have to specify all the individual files separately as input, so it's kind of a pain if there are a lot of them. You might ask on the Calibre forum for more Calibre-specific questions.
susan_cassidy is offline   Reply With Quote
Advert
Old 12-14-2010, 05:51 PM   #3
jocampo
Layback feline
jocampo ought to be getting tired of karma fortunes by now.jocampo ought to be getting tired of karma fortunes by now.jocampo ought to be getting tired of karma fortunes by now.jocampo ought to be getting tired of karma fortunes by now.jocampo ought to be getting tired of karma fortunes by now.jocampo ought to be getting tired of karma fortunes by now.jocampo ought to be getting tired of karma fortunes by now.jocampo ought to be getting tired of karma fortunes by now.jocampo ought to be getting tired of karma fortunes by now.jocampo ought to be getting tired of karma fortunes by now.jocampo ought to be getting tired of karma fortunes by now.
 
jocampo's Avatar
 
Posts: 3,034
Karma: 6980745
Join Date: Nov 2010
Location: USA
Device: Oasis 2nd gen, Sony DPTS1, iPad Pro 10.5"
Hi There,

Converting PDFs, even to ePub, it's a "hit or miss" conversion. It depends more of the document layout than the tool itself.

Using Calibre, these are the best format to convert, in that order: LIT, MOBI, EPUB, HTML, PRC, RTF, PDB, TXT, PDF.

As you can see, PDF is the last one, because the vector images and its own technical specs.
jocampo is offline   Reply With Quote
Old 12-15-2010, 03:43 PM   #4
Blog Kindle
Addict
Blog Kindle knows what's going on.Blog Kindle knows what's going on.Blog Kindle knows what's going on.Blog Kindle knows what's going on.Blog Kindle knows what's going on.Blog Kindle knows what's going on.Blog Kindle knows what's going on.Blog Kindle knows what's going on.Blog Kindle knows what's going on.Blog Kindle knows what's going on.Blog Kindle knows what's going on.
 
Blog Kindle's Avatar
 
Posts: 224
Karma: 25122
Join Date: Mar 2009
Device: Kindle 1/2/3/4/Touch/DX/Fire|PRS-600/350|Nook(color)|iPad|iPad2|EVO 4G
PDF conversion is a mess, because PDF not meant to be converted and refolown. It was originally intended as electronic paper that contains immutable documents.

I'd just use Kindle native PDF support and not convert things.
Blog Kindle is offline   Reply With Quote
Old 12-16-2010, 01:46 PM   #5
Bjorn2Read
Consigliere
Bjorn2Read began at the beginning.
 
Bjorn2Read's Avatar
 
Posts: 4
Karma: 10
Join Date: Dec 2010
Location: Alpha Centauri
Device: Kindle 2
If you are willing to go beyond free tools - purchase Nitro PDF Professional (around $60 online). It it a robust PDF conversion tool: converts PDFs to Word, rtf or txt with very good results. Also, it can grab images from the PDF file and save them to a folder (great if you want to reference them in an html file).
Bjorn2Read is offline   Reply With Quote
Advert
Old 12-17-2010, 09:07 AM   #6
fivebells
Junior Member
fivebells began at the beginning.
 
Posts: 8
Karma: 10
Join Date: Dec 2010
Device: Kindle
Thanks, Bjorn. The promised features are exactly what I need, so I will check out the free trial.
fivebells is offline   Reply With Quote
Old 12-17-2010, 12:07 PM   #7
charonme
Zealot
charonme is a marvel to beholdcharonme is a marvel to beholdcharonme is a marvel to beholdcharonme is a marvel to beholdcharonme is a marvel to beholdcharonme is a marvel to beholdcharonme is a marvel to beholdcharonme is a marvel to beholdcharonme is a marvel to beholdcharonme is a marvel to beholdcharonme is a marvel to behold
 
Posts: 103
Karma: 11904
Join Date: Nov 2010
Device: K3
how does the kindle display your original pdfs? Are the tables mangled too or it looks exactly the same as if it was printed?
charonme is offline   Reply With Quote
Old 12-18-2010, 10:48 AM   #8
jswinden
Nameless Being
 
Quote:
Originally Posted by susan_cassidy View Post
If it tries to create HTML tables, I don't think Kindle supports them well, or possibly not at all. If you save the table as a graphic, and embed that via an <img> tag in the html, it might help some. I know if you use Mobipocket Creator, you have to specify all the individual files separately as input, so it's kind of a pain if there are a lot of them. You might ask on the Calibre forum for more Calibre-specific questions.
I think MOBI has very limited support for HTML tables. It also has a limited support of CSS. It certainly is not as robust as ePub.

@fivebells, But regardless, tables are going to be problematic on any small screen device in any format. There just isn't enough room to place more than a few columns on the screen. And even with say only three columns, the text has to be really small and/or the text wraps so often as to make the table look ugly.

If possible, I suggest rewriting the tabled information into a non-tabled format.
  Reply With Quote
Old 12-18-2010, 04:58 PM   #9
GlassX
Connoisseur
GlassX can name that song in three notesGlassX can name that song in three notesGlassX can name that song in three notesGlassX can name that song in three notesGlassX can name that song in three notesGlassX can name that song in three notesGlassX can name that song in three notesGlassX can name that song in three notesGlassX can name that song in three notesGlassX can name that song in three notesGlassX can name that song in three notes
 
Posts: 79
Karma: 24830
Join Date: Aug 2010
Device: Kindle 3, K4NT
Quote:
Originally Posted by charonme View Post
how does the kindle display your original pdfs? Are the tables mangled too or it looks exactly the same as if it was printed?
It looks exactly the same. However in .pdfs with two columns (e.g. scientific papers) it's either too small to read or too big and needs much scrolling.
GlassX is offline   Reply With Quote
Old 12-19-2010, 11:18 AM   #10
baccilus
Enthusiast
baccilus began at the beginning.
 
Posts: 47
Karma: 10
Join Date: Sep 2010
Location: Chandigarh, India
Device: Kindle 3 WiFi
Quote:
Originally Posted by GlassX View Post
It looks exactly the same. However in .pdfs with two columns (e.g. scientific papers) it's either too small to read or too big and needs much scrolling.
I have observed that two column science papers are easier to read because we can use 200% or more zoom level. However, some full page pdf's can only be read in the landscape mode which I hate.
baccilus is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
PDF conversion ignores images and cropping Starko Calibre 7 11-23-2010 05:25 PM
Is there any e-book reader program who respects the FB2 spec? simonbcn Other formats 0 10-29-2010 08:54 AM
HELP!! Tables in doc. not right on Kindle after Conversion via Calibre MinaNoir Amazon Kindle 3 07-22-2010 01:42 PM
Repeating Images after EPUB to RTF conversion kerrware Calibre 1 07-15-2010 09:05 AM
PRS-300 - PDF Files Loses Tables and Images rtv_73 Sony Reader 12 03-03-2010 03:38 AM


All times are GMT -4. The time now is 09:57 PM.


MobileRead.com is a privately owned, operated and funded community.