Quote:
Originally Posted by cadele
I am going to start to keep some stats - you have inspired me!
|
Glad to hear I have inspired someone else to start keeping stats. I love keeping stats on things that I do. You can get cool things like this:
I liberated 18,172,166 words from PDF -> EPUB since October 2012. (Although I haven't updated the stats in about a month).
And EPUBs that I read for pleasure + cleaned as I went along, 2,870,128 words.
I will have to go through and add in a Page Count to all of the books as well... that might also lead to some decent stats/graphs. (Although in my opinion, pages are a horrible way to measure. A page of non-fiction =/= a page of fiction =/= a page out of a journal/newspaper =/= a page in different font/font-size/margins). And how would you go about handling measuring "Pages" of text from an HTML source?
Quote:
Originally Posted by cadele
Then I open the file in Abbyy Finereader 12 and verify the text. This is slow but worth it. I then convert it to a Word document. Following that I set up my page size and layout. I usually try to match the book's general layout without being too OCD about it.
|
Sounds ok. I guess different workflows for different people.
I personally just do all the fixing in minimalist HTML (EPUB) AND THEN, can go back to other formats if needed.
DOC is really a horrible/bloated "source" format. Too much cruft and inconsistencies added in because of the WYSIWYG editing.
And speaking of trying to "match page size/layout"... Here is a sample of some of my latest ventures into working backwards from EPUB -> LaTeX -> PDF:




I still have to iron out a few kinks... but I have the basics of the workflow going... now I just have a lot more to learn/absorb/code.
Quote:
Originally Posted by cadele
Finally I add the book to Calibre, download the metadata and add the cover, then convert it to EPub and Mobi (both types of Mobi).
|
Hmmm... so a DOC -> Calibre -> EPUB/MOBI conversion? Does that give you the cleanest output?
I probably sound like a broken record, but why not use Toxaris's Word Macro?