Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Readers > Amazon Kindle

Notices

Reply
 
Thread Tools Search this Thread
Old 10-15-2011, 10:23 AM   #16
DiapDealer
Grand Sorcerer
DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.
 
DiapDealer's Avatar
 
Posts: 28,591
Karma: 204624552
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
Quote:
Originally Posted by HarryT View Post
You are wrong. Most PDF documents do NOT contain text. Some have a text "layer" in them, which DOES contain searchable text - this is generally added by OCR.
And that text layer (if it exists at all) can be of very dubious quality.
DiapDealer is online now   Reply With Quote
Old 10-15-2011, 10:42 AM   #17
HarryT
eBook Enthusiast
HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.
 
HarryT's Avatar
 
Posts: 85,548
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
Quote:
Originally Posted by DiapDealer View Post
And that text layer (if it exists at all) can be of very dubious quality.
Exactly - it's generally "raw" OCR, added so that you can search the document.
HarryT is offline   Reply With Quote
Advert
Old 10-15-2011, 01:07 PM   #18
physics
Enthusiast
physics will become famous soon enoughphysics will become famous soon enoughphysics will become famous soon enoughphysics will become famous soon enoughphysics will become famous soon enoughphysics will become famous soon enoughphysics will become famous soon enough
 
Posts: 41
Karma: 729
Join Date: Dec 2010
Device: Kindle DX
There's a lot of negativity for pdfs here (which is understandable), but for those of us who need long and difficult equations, really there's no other formant available besides latex to pdf or postscript. Epub and mobi just don't support complicated equations.

Anyway, back to the question at hand, do you really need to convert your pdf to ebook format? I've managed to get along quite well by cropping pages, adding bookmarks, etc. There are a lot of tools out there for manipulating pdfs. I think learning to live with pdfs on your ereader might be less frustrating than trying to convert it.
physics is offline   Reply With Quote
Old 10-15-2011, 01:32 PM   #19
Daveoc64
Groupie
Daveoc64 has become one with the cosmosDaveoc64 has become one with the cosmosDaveoc64 has become one with the cosmosDaveoc64 has become one with the cosmosDaveoc64 has become one with the cosmosDaveoc64 has become one with the cosmosDaveoc64 has become one with the cosmosDaveoc64 has become one with the cosmosDaveoc64 has become one with the cosmosDaveoc64 has become one with the cosmosDaveoc64 has become one with the cosmos
 
Posts: 170
Karma: 21142
Join Date: Feb 2011
Location: Bristol, UK
Device: Kindle Oasis 3 (LTE)
I'd like to note that I'm not anti-PDF, far from it in fact!

It's just not an ebook format!
Daveoc64 is offline   Reply With Quote
Old 10-15-2011, 01:45 PM   #20
shinew
Addict
shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.
 
Posts: 309
Karma: 1008082
Join Date: Feb 2009
Location: NYC
Device: Kindle PW, K4 Touch, iPad2, Samsung Galaxy S II
iPad+GoodReader = PDF Win
shinew is offline   Reply With Quote
Advert
Old 10-15-2011, 02:35 PM   #21
HarryT
eBook Enthusiast
HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.
 
HarryT's Avatar
 
Posts: 85,548
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
Quote:
Originally Posted by shinew View Post
iPad+GoodReader = PDF Win
True, but entirely irrelevant to a Kindle discussion.
HarryT is offline   Reply With Quote
Old 10-15-2011, 02:44 PM   #22
shinew
Addict
shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.shinew ought to be getting tired of karma fortunes by now.
 
Posts: 309
Karma: 1008082
Join Date: Feb 2009
Location: NYC
Device: Kindle PW, K4 Touch, iPad2, Samsung Galaxy S II
Quote:
Originally Posted by HarryT View Post
True, but entirely irrelevant to a Kindle discussion.
it means give it up on the Kindle & get something more suitable for general PDF reading. iPad currently probably has the best apps for it.
shinew is offline   Reply With Quote
Old 10-15-2011, 03:13 PM   #23
amoroso
Groupie
amoroso ought to be getting tired of karma fortunes by now.amoroso ought to be getting tired of karma fortunes by now.amoroso ought to be getting tired of karma fortunes by now.amoroso ought to be getting tired of karma fortunes by now.amoroso ought to be getting tired of karma fortunes by now.amoroso ought to be getting tired of karma fortunes by now.amoroso ought to be getting tired of karma fortunes by now.amoroso ought to be getting tired of karma fortunes by now.amoroso ought to be getting tired of karma fortunes by now.amoroso ought to be getting tired of karma fortunes by now.amoroso ought to be getting tired of karma fortunes by now.
 
amoroso's Avatar
 
Posts: 185
Karma: 1004070
Join Date: Jul 2010
Location: Italy
Device: Kindle for Android, Google Play Books
Quote:
Originally Posted by tentimes View Post
I am confused about why there is no program out there that can take the textual information in a pdf book, plus the index (bookmarks) and turn it into a an indexed book.
A PDF document is a software program containing instructions written in a restricted subset of the PostScript document description language, which is a full blown stack-based programming language. Extracting text from a PDF document is difficult because it is not stored in specific sections of the file, but scattered in difficult to predict ways among the instructions that generate the document layout. Consider for example the following pseudocode fragments that print the same "book" string:
Quote:
PRINT "book"

PRINT "b" + "o" + "o" + "k"

PRINT "bo" + "o" + "k"

string = { "b", "o", "o", "k" }
FOR i IN string
PRINT string($i)
There are countless more equivalent code fragments, all different, that generate the same text. Each of the fragments includes in a different way the text or part of it. Extracting the text, and even locating it, is difficult. Something similar happens with instructions in a PDF file. A PDF conversion utility is actually a program-analyzing tool.
amoroso is offline   Reply With Quote
Old 10-15-2011, 04:04 PM   #24
DiapDealer
Grand Sorcerer
DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.
 
DiapDealer's Avatar
 
Posts: 28,591
Karma: 204624552
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
Quote:
There's a lot of negativity for pdfs here (which is understandable), but for those of us who need long and difficult equations, really there's no other formant available besides latex to pdf or postscript. Epub and mobi just don't support complicated equations.
Actually most people are fine with PDF's. I know I am (except on 6 inch screens). Most of the negativity comes into play when someone wants to turn a PDF into something else. A PDF just doesn't convert well... easily. I doubt it ever will.
DiapDealer is online now   Reply With Quote
Old 10-18-2011, 04:20 PM   #25
tentimes
Junior Member
tentimes began at the beginning.
 
Posts: 6
Karma: 10
Join Date: Oct 2011
Device: Kindle 4
Well I tried pdfmasher and it is doing a good job of extracting the text - I would love to know how it does it.

It's just that I have spent so much money on pdf books and I am determined that I am not paying for them again.

My best solution to date now is to use pdfmasher or bliss and send to amazon for convert.

I am sure that someone will complete the next step and write a program to do a proper conversion, with a bit of AI logic. The sooner the better for all of us who bought technical books on pdf

Last edited by tentimes; 10-18-2011 at 04:27 PM.
tentimes is offline   Reply With Quote
Old 10-18-2011, 04:26 PM   #26
DiapDealer
Grand Sorcerer
DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.
 
DiapDealer's Avatar
 
Posts: 28,591
Karma: 204624552
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
Quote:
I am sure that someone will complete the next step and write a program to do a proper conversion, with a bit of AI logic. The sooner the better for all of us who bought technical books on pdf
I seriously doubt that will ever happen. PDF formatting varies wildly from document to document. So much so, that some sort of manual intervention and/or manipulation will almost certainly be required for any quality PDF conversion. Automatic quality conversions is pretty much out of the question.
DiapDealer is online now   Reply With Quote
Old 10-18-2011, 05:17 PM   #27
alsaan
Enthusiast
alsaan knows what time it isalsaan knows what time it isalsaan knows what time it isalsaan knows what time it isalsaan knows what time it isalsaan knows what time it isalsaan knows what time it isalsaan knows what time it isalsaan knows what time it isalsaan knows what time it isalsaan knows what time it is
 
alsaan's Avatar
 
Posts: 27
Karma: 2000
Join Date: Aug 2010
Device: Kindle 2 & Kindle 3
The PDF format just doesn't preserve enough information about the structure of the document to allow further conversions. I like to think of it as a "lossy" text format, of sorts.

Trying to extract a properly formatted document from a PDF is akin to hoping to recover a full-sized image by "enhancing" a small thumbnail.
alsaan is offline   Reply With Quote
Old 10-18-2011, 05:41 PM   #28
Zeebra
Evangelist
Zeebra ought to be getting tired of karma fortunes by now.Zeebra ought to be getting tired of karma fortunes by now.Zeebra ought to be getting tired of karma fortunes by now.Zeebra ought to be getting tired of karma fortunes by now.Zeebra ought to be getting tired of karma fortunes by now.Zeebra ought to be getting tired of karma fortunes by now.Zeebra ought to be getting tired of karma fortunes by now.Zeebra ought to be getting tired of karma fortunes by now.Zeebra ought to be getting tired of karma fortunes by now.Zeebra ought to be getting tired of karma fortunes by now.Zeebra ought to be getting tired of karma fortunes by now.
 
Zeebra's Avatar
 
Posts: 463
Karma: 956567
Join Date: Oct 2010
Location: Toronto, Canada
Device: Kindle Oasis 3
Quote:
Originally Posted by tentimes View Post
Well I tried pdfmasher and it is doing a good job of extracting the text - I would love to know how it does it.

It's just that I have spent so much money on pdf books and I am determined that I am not paying for them again.

My best solution to date now is to use pdfmasher or bliss and send to amazon for convert.

I am sure that someone will complete the next step and write a program to do a proper conversion, with a bit of AI logic. The sooner the better for all of us who bought technical books on pdf
I was playing around with PDfmasher as well and it's fairly cool. Very easy to remove headers and footers but I was having issues getting footnotes to work.
Zeebra is offline   Reply With Quote
Old 10-18-2011, 06:03 PM   #29
DiapDealer
Grand Sorcerer
DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.
 
DiapDealer's Avatar
 
Posts: 28,591
Karma: 204624552
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
Quote:
I was playing around with PDfmasher as well and it's fairly cool. Very easy to remove headers and footers but I was having issues getting footnotes to work.
I've found the footnotes function to be a tad flaky with PDFMasher, but I've gotten pretty close a few time with documents that had buckets of footnotes. Close enough that I didn't mind fixing up the results by hand. And it seems to be getting better all the time. The different sorting abilities makes it pretty powerful and it's by far my favorite for very simply formatted novels, but losing italics really annoys me when converting PDF's (not PDFMasher's fault, I know).
DiapDealer is online now   Reply With Quote
Old 10-18-2011, 06:39 PM   #30
Blossom
Treasure Seeker
Blossom ought to be getting tired of karma fortunes by now.Blossom ought to be getting tired of karma fortunes by now.Blossom ought to be getting tired of karma fortunes by now.Blossom ought to be getting tired of karma fortunes by now.Blossom ought to be getting tired of karma fortunes by now.Blossom ought to be getting tired of karma fortunes by now.Blossom ought to be getting tired of karma fortunes by now.Blossom ought to be getting tired of karma fortunes by now.Blossom ought to be getting tired of karma fortunes by now.Blossom ought to be getting tired of karma fortunes by now.Blossom ought to be getting tired of karma fortunes by now.
 
Blossom's Avatar
 
Posts: 18,708
Karma: 26026435
Join Date: Mar 2010
Device: Kobo HD Glo, Kindles, Kindle Fires, Andriod Devices
Quote:
Originally Posted by DiapDealer View Post
I've found the footnotes function to be a tad flaky with PDFMasher, but I've gotten pretty close a few time with documents that had buckets of footnotes. Close enough that I didn't mind fixing up the results by hand. And it seems to be getting better all the time. The different sorting abilities makes it pretty powerful and it's by far my favorite for very simply formatted novels, but losing italics really annoys me when converting PDF's (not PDFMasher's fault, I know).
So it doesn't convert italics? I was going to give a shot but I do get good results with Acrobat Pro on Novel PDFs. It pulls the styles from the PDF just fine as long as the PDF is tagged.
Blossom is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
KINDLE DEAL: The Holy Bible: NKJV ($3.36 CANADA) gospelebooks Deals and Resources (No Self-Promotion or Affiliate Links) 2 04-09-2011 12:07 PM
Free Book (Kindle / Nook) - The Holy Bible koland Deals and Resources (No Self-Promotion or Affiliate Links) 21 11-14-2010 01:51 PM
Free Book (Kindle) - The Holy Bible koland Deals and Resources (No Self-Promotion or Affiliate Links) 21 10-09-2010 10:31 AM
Free Book (Kindle) - Holy Bible (GW) koland Deals and Resources (No Self-Promotion or Affiliate Links) 0 10-04-2010 03:29 AM
The search for the Holy Grail of reading lights continues Bob Russell News 19 04-01-2009 01:24 PM


All times are GMT -4. The time now is 03:14 PM.


MobileRead.com is a privately owned, operated and funded community.