Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Conversion

Notices

Reply
 
Thread Tools Search this Thread
Old 10-15-2011, 02:26 PM   #1
sungkhum
Junior Member
sungkhum began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Jan 2010
Device: Kindle
Creation of Complex Script E-Book Using Images for Every Word?

Hello,

I am looking for a way to create an e-book with the Khmer language (which is UTF-8). No reader currently supports Khmer that I am aware of, so I wanted to see if there is an "easy" way to convert each Khmer word a document (a plain text document is fine) to an image and use the images as the content of an e-book. I've seen images used for formulas and Greek and Hebrew, but never as the whole content of a book. Is it possible? How should I go about doing it? Is there currently any way to automatically convert every word of text to images to be used in an e-book?

Thanks,
Nathan
sungkhum is offline   Reply With Quote
Old 10-15-2011, 03:55 PM   #2
wallcraft
reader
wallcraft ought to be getting tired of karma fortunes by now.wallcraft ought to be getting tired of karma fortunes by now.wallcraft ought to be getting tired of karma fortunes by now.wallcraft ought to be getting tired of karma fortunes by now.wallcraft ought to be getting tired of karma fortunes by now.wallcraft ought to be getting tired of karma fortunes by now.wallcraft ought to be getting tired of karma fortunes by now.wallcraft ought to be getting tired of karma fortunes by now.wallcraft ought to be getting tired of karma fortunes by now.wallcraft ought to be getting tired of karma fortunes by now.wallcraft ought to be getting tired of karma fortunes by now.
 
wallcraft's Avatar
 
Posts: 6,975
Karma: 5183568
Join Date: Mar 2006
Location: Mississippi, USA
Device: Kindle 3, Kobo Glo HD
What you need is a format that allows embedded fonts, and a publically available font that supports Khmer. One good option is ePub, but it is not available on the Kindle.

Unfortunately, Kindle's AZW (MOBI) format does not support embedded fonts. So if you want something that works on a Kindle the best option is a PDF with embedded fonts that has a page size optimized for a small screen. Most PDFs don't work well on the Kindle because thay assume a letter/A4 sized page. Since you are creating your own PDF from a word processor (say), you can optimize the PDF either by changing the page size or by using a big font size. All that matters is how many words fit on a line (you want fewer words on a small screen).
wallcraft is offline   Reply With Quote
Advert
Old 10-16-2011, 02:18 AM   #3
ldolse
Wizard
ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.
 
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
Agree with Wallcraft on ePub.

For the Kindle you could use the Topaz format to accomplish this, but I don't know if Amazon provides the tools for small/independent publisher to create a Topaz book - there's no public domain/open source way to do it.
ldolse is offline   Reply With Quote
Old 10-18-2011, 11:29 PM   #4
sungkhum
Junior Member
sungkhum began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Jan 2010
Device: Kindle
Thanks for your replies. I am interested more in doing an "image" based book (an image for each word would be best because then screen size would be less of an issue as the lines could break at each word) because Khmer Unicode really isn't supported by many platforms, so embedding a font wouldn't solve the issue.

Are there any other options other than Topaz that will convert each word into an image for use with an e-book reader?

Thanks,
Nathan
sungkhum is offline   Reply With Quote
Old 10-18-2011, 11:57 PM   #5
ldolse
Wizard
ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.
 
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
Topaz is the only option that does what you want exactly the way you describe.

PDF is your next best option - if you use Acrobat professional it can add 'hints' that will tell a reader where it can re-flow the text, but not all readers support that. Also make sure to create the pdf with the most common screen size in mind - 800x600 pixels.
ldolse is offline   Reply With Quote
Advert
Old 10-19-2011, 12:44 AM   #6
sungkhum
Junior Member
sungkhum began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Jan 2010
Device: Kindle
Ok, thanks. I guess I will start looking around for ways to create the images from the words automatically and then revisit the ebook part.

Thanks again,
Nathan
sungkhum is offline   Reply With Quote
Old 10-19-2011, 09:19 AM   #7
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by ldolse View Post
Topaz is the only option that does what you want exactly the way you describe.
He might want to consider djvu as a format. It's useful for scanned books. I think of it as a cross between PDF and Topaz. It has multiple layers and divides the book into images and text. It compresses the images, like PDf, but it also handles scanned images of individual letters, sort of like Topaz. It compresses the first individual letter image it finds, then locates all the similar images of that letter on the page and links to the single scanned image of that letter. That way it doesn't have to store an image of the entire page, nor multiple images of the same letter.

The multiple layers allows a low res image (100dpi IIRC) to be displayed as you scroll through the pages quickly, and then the high res image gets overlaid as you stop. It's open source. Wikipedia has a summary article on it I believe.
Starson17 is offline   Reply With Quote
Old 10-19-2011, 11:52 AM   #8
ldolse
Wizard
ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.
 
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
Quote:
Originally Posted by Starson17 View Post
He might want to consider djvu as a format. It's useful for scanned books. I think of it as a cross between PDF and Topaz. It has multiple layers and divides the book into images and text. It compresses the images, like PDf, but it also handles scanned images of individual letters, sort of like Topaz. It compresses the first individual letter image it finds, then locates all the similar images of that letter on the page and links to the single scanned image of that letter. That way it doesn't have to store an image of the entire page, nor multiple images of the same letter.

The multiple layers allows a low res image (100dpi IIRC) to be displayed as you scroll through the pages quickly, and then the high res image gets overlaid as you stop. It's open source. Wikipedia has a summary article on it I believe.
Is there a reader which supports djvu? Agree the format is pretty appropriate to his requirements, but I wasn't aware of any readers (at least the usual suspects) supporting it... I don't believe djvu supports reflow, IMHO Topaz's ability to reflow is what makes it a pretty awesome format for this sort of application, shame Amazon doesn't provide tools to allow small publishers to leverage it.
ldolse is offline   Reply With Quote
Old 10-26-2011, 04:20 PM   #9
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by ldolse View Post
Is there a reader which supports djvu?
EBookDroid on Android is what I use.
Quote:
I don't believe djvu supports reflow, IMHO Topaz's ability to reflow is what makes it a pretty awesome format
AFAIK, you are correct - no reflow. It seems more oriented to reducing the size of the file as compared to pdfs for scanned pages that typically have lots of images - like textbooks and technical literature. They hoped to get it used for making scanned pages more accessible over bandwidth limited links.
Starson17 is offline   Reply With Quote
Reply

Tags
complex script, conversion, khmer, unicode


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Création d'un script de conversion automatique sur Internet ODT-to-EPUB nixSta Software 4 07-15-2011 03:09 AM
Linux script for images to Kindle Screensaver soymicmic Kindle Developer's Corner 4 01-28-2011 11:20 AM
Q: Tables, images, and word-wrap AndrewH Workshop 2 12-22-2010 02:34 AM
Request Need support for complex script languages: Arabic! osama.zaidiah enTourage Archive 0 11-06-2010 07:35 AM
Converting complex MS-word documents Eclipse General Discussions 15 06-22-2010 06:59 PM


All times are GMT -4. The time now is 11:25 AM.


MobileRead.com is a privately owned, operated and funded community.