Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Readers > Amazon Kindle

Notices

Reply
 
Thread Tools Search this Thread
Old 08-17-2010, 12:33 AM   #1
beacher
Junior Member
beacher began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Aug 2010
Device: Kindle
PDF -> AZW conversion, weird character spacing

I'm sure this has been asked, probably a million times, but for the life of me I could not find it, maybe I'm using the wrong search terms.

Any how, I've tried converting PDF with calibre, and with amazon's service, and in both scenarios, I get weird space in a lot of the text (with 2 books now), something like:

T h i s is a n ex a m p le.

Then when I try to have the kindle do text-to-speech it reads the letters to me rather then the words.

I'm hoping there are some things I can try to rectify this?
beacher is offline   Reply With Quote
Old 08-17-2010, 01:37 PM   #2
dmin7th
Member
dmin7th began at the beginning.
 
Posts: 14
Karma: 10
Join Date: Dec 2009
Device: Kindle 2, 3 & DX
Although I've only tried converting a few PDFs to mobi for my Kindles so far, I've had much better luck with Mobipocket Creator than with Calibre. I haven't come across the issue you're having yet, though.
dmin7th is offline   Reply With Quote
 
Enthusiast
Old 08-17-2010, 02:04 PM   #3
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 37,665
Karma: 18475502
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Sony Reader PRS-650, iPad, nook STR
Actually, you cannot go PDF > AZW. AZW does not actually exist as a format when there is no DRM. If you strip the DRM from AZW, you get Mobipocket. So AZW without DRM is actually PRC/Mobi.
JSWolf is offline   Reply With Quote
Old 08-17-2010, 02:51 PM   #4
beacher
Junior Member
beacher began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Aug 2010
Device: Kindle
Ok, I'll try Mobi Pocket Creator, thanks!

I emailed Amazon a PDF, and they sent back an AZW.. what that means I don't know, but I believe it said "AZW" in it.
beacher is offline   Reply With Quote
Old 08-17-2010, 02:59 PM   #5
AnemicOak
Bookaholic
AnemicOak ought to be getting tired of karma fortunes by now.AnemicOak ought to be getting tired of karma fortunes by now.AnemicOak ought to be getting tired of karma fortunes by now.AnemicOak ought to be getting tired of karma fortunes by now.AnemicOak ought to be getting tired of karma fortunes by now.AnemicOak ought to be getting tired of karma fortunes by now.AnemicOak ought to be getting tired of karma fortunes by now.AnemicOak ought to be getting tired of karma fortunes by now.AnemicOak ought to be getting tired of karma fortunes by now.AnemicOak ought to be getting tired of karma fortunes by now.AnemicOak ought to be getting tired of karma fortunes by now.
 
AnemicOak's Avatar
 
Posts: 10,405
Karma: 28889083
Join Date: Oct 2007
Location: Minnesota
Device: HDX 8.9, AuraHD, Nook HD+, Kindle 2,3,T , Opus, Nexus7, iPhone5, etc
Quote:
Originally Posted by JSWolf View Post
Actually, you cannot go PDF > AZW. AZW does not actually exist as a format when there is no DRM. If you strip the DRM from AZW, you get Mobipocket. So AZW without DRM is actually PRC/Mobi.
Sure it does (exist when there is no DRM). Non-DRM'd books purchased from Amazon are still .azw format. It may be the same thing as mobi, but it's got an AZW extension, making it an AZW file.
AnemicOak is offline   Reply With Quote
Old 08-17-2010, 06:45 PM   #6
beacher
Junior Member
beacher began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Aug 2010
Device: Kindle
I tried the software suggested, as well as another, seems as soon as it's converted from PDF to anything, there is double spacing in between characters, almost randomly.

Maybe it's only these couple ebooks I'm trying (maybe they are OCR scans)...but it sure stinks. If I don't find any software, I might attempt writing a script that will remove the spaces.
beacher is offline   Reply With Quote
Old 08-17-2010, 07:04 PM   #7
dmin7th
Member
dmin7th began at the beginning.
 
Posts: 14
Karma: 10
Join Date: Dec 2009
Device: Kindle 2, 3 & DX
Quote:
Originally Posted by beacher View Post
If I don't find any software, I might attempt writing a script that will remove the spaces.
That's probably your best bet if it's something in the PDF that's weird. I edited some stuff in the HTML file that Mobipocket Creator generated before I built the .mobi file, and that worked pretty well for me.
dmin7th is offline   Reply With Quote
Old 08-17-2010, 09:54 PM   #8
tomsem
Wizard
tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.
 
Posts: 2,444
Karma: 2519673
Join Date: Apr 2009
Location: USA
Device: iPod Touch, Xoom, Kindle PW, iPad3, Fire HD2
Quote:
Originally Posted by beacher View Post
I tried the software suggested, as well as another, seems as soon as it's converted from PDF to anything, there is double spacing in between characters, almost randomly.

Maybe it's only these couple ebooks I'm trying (maybe they are OCR scans)...but it sure stinks. If I don't find any software, I might attempt writing a script that will remove the spaces.
Could be, and as you suspect, it's dumping text that was generated with OCR from scanned images. Or it might be some form of copy protection (inserting random invisible whitespace to make conversion less attractive). Load it into Adobe Reader (free), try searching for some text and see if it can find things as expected, or see if copy/paste has the same issue.

Also, Adobe has a free PDF-to-text and PDF-to-HTML service. You might try it for another data point. I suspect the points will continue to all line up however.

Online conversion tools for Adobe PDF documents
tomsem is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
AZW to EPUB conversion - overlapping letters suecsi Calibre 4 10-16-2010 11:53 PM
PDF to prc/azw Batch Conversion xsolitudex PDF 2 09-04-2010 10:19 AM
AZW Conversion elliskatz Introduce Yourself 7 08-14-2010 05:47 AM
Line Spacing on PDF to Epub conversion poodlemama Calibre 2 05-03-2010 08:28 PM
weird ascii character p3aul Calibre 7 10-14-2009 05:10 PM


All times are GMT -4. The time now is 08:44 AM.


MobileRead.com is a privately owned, operated and funded community.