View Full Version : conversion using EBWLibrarian


sassanik
09-03-2008, 05:22 AM
Okay so this is probably not the right spot to ask this question, but I am hoping that you guys can point me into the right direction.

So I regularly import books to ebwlibrarian and their format then gets converted to imp.

Rather annoyingly a number of books seem to have some conversion issues mostly with ' and " being replaced with a ? instead. While the book is still readable it would be nice to not have that happen.

Is there a way to help prevent this? would changing the font before import help? suggestions ideas?

Whatever ConvertLit does it seems to make the files import the best, better than html, txt, or rtf. It seems to have the least problems upon importing the books, probably because I am using the oeb format?

Anyway, suggestions ideas, pointers in the direction of another forum?

Thanks!

Amy

ashkulz
09-03-2008, 07:34 AM
Okay so this is probably not the right spot to ask this question, but I am hoping that you guys can point me into the right direction.

So I regularly import books to ebwlibrarian and their format then gets converted to imp.

Rather annoyingly a number of books seem to have some conversion issues mostly with ' and " being replaced with a ? instead. While the book is still readable it would be nice to not have that happen.

Is there a way to help prevent this? would changing the font before import help? suggestions ideas?

Whatever ConvertLit does it seems to make the files import the best, better than html, txt, or rtf. It seems to have the least problems upon importing the books, probably because I am using the oeb format?

Anyway, suggestions ideas, pointers in the direction of another forum?

Thanks!

Amy Most probably what happens is that the book is produced by Word or some variant thereof, and uses smart quotes directly as characters. When converting, it is run through HTML Tidy internally by the ebook publisher conversion process which causes that. The solution is simple: install Tidy (http://tidy.sourceforge.net/#binaries) yourself, and from the command line type: tidy -m -win1252 input.html which should convert the file in place to a form which is suitable for input to the EBW Librarian (or any other conversion tool).

sassanik
09-03-2008, 05:03 PM
Thanks I will give it a try! Hopefully that fix the issue.

Amy