Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 08-31-2010, 03:01 PM   #1
_Rodrigo_
Happy Calibering :D
_Rodrigo_ began at the beginning.
 
_Rodrigo_'s Avatar
 
Posts: 2
Karma: 10
Join Date: Aug 2010
Location: Rio de Janeiro, Brasil
Device: Nook
Smile Conversion Issues in Nook


Hi, everybody!

First of all, I wish to thanks for this amazing software!! It has a stupendous quality, full of features and highly customizable! I've been reading ebooks since 2005 and never saw something so good!

Recently, I bought a nook for me and start trying to convert some old pdfs I have. I tried nook built-in pdf adapter, but it is not working very well.

So, I am using Calibre to convert them to EPUB. Unfortunately, with some books, some letters were exchanged for others, much like OCR sometimes does. For example, the word find once become fmd (that is, IN became a M). In others, words are separated or joined.

Although this is a very minor issue, sometimes is a problematic one for me, because, as you may have guessed by my poor English skills, I am not a native speaker and sometimes I cannot guess the original word. Worse yet, I like to read sci-fi and fantasy, two genres in which strange words are a certainty! I have already saw several new ways to write the word illithid

Besides this question, sometimes the converted document has spaces and paragraphs added or subracted.

All this preambule is to kindly ask you if is there any thing I can do to help improve the quality of the pdfs I input to Caliber.

Until now, I am using a case-by-case approach, sometimes first converting the PDF to a simple txt file or a html one, but, besides been a lengthy process, it isn't a universal one.

Has someone any advices for me? Guides, advices and the like are welcome! I read the Nook documentation, FAQ included, but I could not help me through them. If I am missing something in the official documentation, please point it to me

Kind regards,

Rodrigo

_Rodrigo_ is offline   Reply With Quote
Old 08-31-2010, 03:21 PM   #2
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by _Rodrigo_ View Post
So, I am using Calibre to convert them to EPUB. Unfortunately, with some books, some letters were exchanged for others, much like OCR sometimes does.
Calibre doesn't do OCR. If you have a pdf that has a word in it like "find" and you convert it, then see the word "fmd," then the word "fmd" was in the original pdf. What happened is this: The original pdf was a scanned image. Someone later ran the pdf through an OCR. Adobe Acrobat has that capability. When you read the book as a pdf, you saw the scanned image and the words looked correct. If you had searched for the word, however, you would not have been able to locate the word "find," only the word "fmd." When Calibre converts, it uses the OCR words, not the images.

Quote:
All this preambule is to kindly ask you if is there any thing I can do to help improve the quality of the pdfs I input to Caliber.
You need a better OCR on the images, or you need to proof them first.
Starson17 is offline   Reply With Quote
Old 08-31-2010, 03:24 PM   #3
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 37,007
Karma: 18129756
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Sony Reader PRS-650, iPad, nook STR
The problem is that converting PDFs is not possible to be done without errors. PDF is not a text format. So when you convert, expect errors. There are no programs out there that will do a perfect conversion. Even Adobe Acrobat cannot convert without errors. Calibre is no different.
JSWolf is offline   Reply With Quote
Old 08-31-2010, 03:36 PM   #4
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by JSWolf View Post
The problem is that converting PDFs is not possible to be done without errors. PDF is not a text format. So when you convert, expect errors. There are no programs out there that will do a perfect conversion. Even Adobe Acrobat cannot convert without errors. Calibre is no different.
Just to be clear - Calibre is different in that it can't do OCR, while Acrobat can. His pdf book had been previously run through an OCR conversion. There are lots of pdf books out there that are scanned images of pages only, Other books are like his - they have images of the page and hidden OCR text that can be searched. The latter can be converted by Calibre, but it's terrible.
Starson17 is offline   Reply With Quote
Reply

Tags
convert, nook

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Calibre 0.7.21, conversion issues. Nadrek Calibre 3 10-02-2010 09:48 PM
mobi conversion - kindle chapter marker issues foghat Calibre 10 09-29-2010 07:46 PM
Conversion issues primetime34 Calibre 1 02-20-2010 01:30 AM
TXT, RTF, and HTML conversion issues daesdaemar Calibre 15 12-10-2008 09:05 PM
Fed up with conversion issues on Sony reader BlackVoid Which one should I buy? 23 04-14-2008 08:12 AM


All times are GMT -4. The time now is 01:45 PM.


MobileRead.com is a privately owned, operated and funded community.