comet

03-10-2012, 12:45 PM

Hallo,

I observed at amazon.de that many titles (in english) they former sell in topaz aren't available any longer. But the ones I bought from the rest that is still available seems to have much more quality. Is there an improved version of the topaz-format?

DaleDe

03-10-2012, 02:30 PM

I do not believe the format has anything to do with the quality. topaz is the format used for OCR documents and it is likely you are seeing some proofing being done. See our wiki for more information on this format.

comet

03-10-2012, 02:49 PM

I already read the wiki. But If I understand it correctly, the only factor of quality seems to be the quality of the scan. Is this right?

DaleDe

03-10-2012, 05:53 PM

What exactly quality improvements are you seeing in the books you bought?

Dale

comet

03-11-2012, 05:50 AM

Two kinds (I read mathematical texts in topaz-format): Some time ago I often missed in mathematical expressions parts of the expression like an index and so on. Now the expressions are always complete. Second the integration of mathematical expressions into the surrounding text looks now very nice with respect to the size and the form of the expressions.

DaleDe

03-11-2012, 11:19 PM

Two kinds (I read mathematical texts in topaz-format): Some time ago I often missed in mathematical expressions parts of the expression like an index and so on. Now the expressions are always complete. Second the integration of mathematical expressions into the surrounding text looks now very nice with respect to the size and the form of the expressions.

It sounds like they have improved the processing of the data with regard to mathematical text. Topaz uses a technique very similar to that used in DJVU. You can read about how DJVU works in our wiki. To be more precise it is like IW44 which is a simplified subset of DJVU. I suspect they have improved the glyph location and sizing techniques to optimize the use of mathematical equations. The database itself would not need changing to accommodate this sort of improvement, however, the scanning processing would need to be better. Glad to hear about this improvement.

Dale