Quote:
Originally Posted by HarryT
Lots of PG books have page numbers, normally in square brackets in the text. I always remove them - I find them to be distracting.
|
I've been working on books at Distributed Proofreaders (
http://www.pgdp.net/c/). They clean up books converted from scans, and make nice clean copies available on Project Gutenberg (automatically generated copies, which PG also has, tend to have a lot of OCR and formatting errors). Anyway, DP has a perhaps-overly-ornate multiple stage process where each page is proofed by three different people, then formatting is done and checked by multiple people, and then it goes to post-processing where it's made into a text version and an html version, and then someone else checks that before it goes to PG. The epubs are automatically generated from the html versions, I believe, though there's talk of hand-massaging those too for the optimum results as well.
I have been working in post processing of late. The page numbers you describe are part of their standard. I just recently finished a book that had about 30 pages of index, with every reference pointing to a page number. I spent a lot of time building links so that you could click on each thing in the index and be taken to the right page in the book. I hope that when it is converted to epub, all that stuff will come along for the ride, with or without page numbers.