![]() |
#1 |
Junior Member
![]() Posts: 3
Karma: 10
Join Date: Jun 2017
Device: Kindle
|
Mixed Characters issue
Hi everyone,
I'm trying to convert from PDF to Mobi a book that has a mix of characters from Japanese and Portuguese. It's a study book basically. When I try directly converting with Calibre, every japanese part breaks, like: What it should be: "This structure is always used in this way: ownerのthing." What Calibre is doing: "This structure is always used in this way: owner thing . の" (Yes, if it's in the same sentence, the japanese characters break to be after all the latin characters) Someone knows how to fix this without having to convert into AZW3 and edditing HTML (it does work when I edit by hand but the book is more than 13k lines of HTML)? Thanks in advance |
![]() |
![]() |
![]() |
#2 |
null operator (he/him)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 21,721
Karma: 29711016
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
@biellz1221 - conversion from PDF is problematic, see See Read this before Posting PDF Questions
If you created the book, then what did you use to create it, that would almost certainly be a better place to start. BR |
![]() |
![]() |
Advert | |
|
![]() |
#3 | |
Junior Member
![]() Posts: 3
Karma: 10
Join Date: Jun 2017
Device: Kindle
|
Quote:
I tried converting the PDF into a word file, it came out with less errors than the MOBI, but calibre does not convert from .doc or .docx |
|
![]() |
![]() |
![]() |
#4 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,553
Karma: 950151
Join Date: Nov 2008
Device: Sony PRS-950, iphone/ipad (Marvin/iBooks/QuickReader)
|
|
![]() |
![]() |
![]() |
#5 | ||
null operator (he/him)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 21,721
Karma: 29711016
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
![]() ![]() Quote:
However, if you have MS Word editing skills and your knowledge of editing ePub internals (XHTML and CSS) is limited, I suggest you correct the errors in Word, save as DOCX and then convert the DOCX. See ==>> Format specific conversion tips. I would convert to EPUB, you may need to do a bit of editing of the EPUB with the calibre editor before converting it to a Kindle format. This Word add-on might be useful to you ==>> e-Book Tools - a Word add-in. It includes tools specifically targeted at dealing with messy documents created from scanning and PDF conversions. And it can create the EPUB directly. BR |
||
![]() |
![]() |
Advert | |
|
![]() |
#6 | |
Junior Member
![]() Posts: 3
Karma: 10
Join Date: Jun 2017
Device: Kindle
|
Quote:
I work as a front-end developer so editing XHTML and CSS is fine, but with 13k lines, it's just too much work. Updating on my MSWord attempt, I was able to convert .docx, but not .doc, and the result of .docx -> .mobi was surprisingly good. The characters worked well as if they were on a single charset and Kindle displayed them just fine. I'll just adjust the .docx input to get a finer MOBI result. Thanks for all the help =D I'll definitely try to get in touch with the teachers and talk about having a eReader friendly version of their book, i'm sure it'll help many of my course mates =D |
|
![]() |
![]() |
![]() |
#7 |
null operator (he/him)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 21,721
Karma: 29711016
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
@biellz1221 - Sounds like you're cooking with gas - good luck with your Japanese studies.
BR |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Mixed sentences | Namib Sky | Calibre | 2 | 05-31-2017 03:10 PM |
¿Convert unicode decomposed characters to unique/normal characters? | JohnQwerty | Calibre | 3 | 04-05-2012 12:08 PM |
I have mixed feelings about this one: No More Cursive Handwriting | kennyc | Lounge | 106 | 04-03-2011 09:25 PM |
PRS-600 Mixed feelings | ziegl027 | Sony Reader | 6 | 04-13-2010 03:47 PM |
Mixed-up thumbnail images | momghoti | Calibre | 5 | 01-28-2010 12:30 PM |