Originally Posted by andym
Out of interest (I've just been spending way too much time restoring the accents in the PG text of Nostromo)). Do you have dictionary software that will restore accents automatically?
Well... We're using a dictionnary for hyphenation on PDF files. We're not changing any accents yet, guess it could be added on our todo list for preprocessing with also curly quotes.
bowerbird: On Project Gutenberg, italics are indicated with _ not all caps. I'll take a look at what all caps is used for exactly, guess that's another thing that we could add to our preprocessing.