Originally Posted by SomeNewGuy
Can someone explain how exactly the process of converting works?? Why does it take so long to finish?? And why is it that difficult??
GIGO is why it takes long(er)
PDF is probably the worst. (OK, Clay tablets.They won't fit into any floppy drive
PDF is a finished PAGE. How the page is put together does not need to be assembled in Linear order. It is like a Puzzle. You keep placing pieces until it is done. Same with PDF.
Guessing the Order the pieces need to appear can be a challange
Some formats don't allow flexibility on style and layout. Simple TXT (like made with Notepad.exe) have nothing but spaces and characters. No font faces or font sizes. (HTML is a text file with markup tags that the viewer can interpert).
DOC files are loaded with all the fonts and stuff, the problem is the messy internals result in huge sized (files) conversions.
Some of the other source types are somewhere in between TXT and (X)HTML(the core of EPUB) with detectable, formatting content.