What I'm doing when I scan books is including page numbers as ID's on paragraph tags since I use HTML as my master copy. This means that the page number might be off by a paragraph, but it will be close enough to work as a reference. I'm not currently including a list of links to the page numbers in the file, they're more there if I want to make an index later. Plus, of course, some of them are missing since my scan+OCR process misses a few and fiction books often leave out numbers on chapter heading pages.
I don't currently make an opf with publication details - is there a good tool for editing that? What other information is left out of the HTML? I don't put cover scans in the file but I keep them with the book (I want not to have the cover scans in the lrf on my reader).
|