Quote:
Originally Posted by sealbeater
The suggestion to drop all of my current projects, including my day job, was the respondent's, not mine.
As for scripting it out in 20 minutes, I don't know *exactly* how long it would take for me to do it but I don't think it would be overly difficult. Take the 20 minute timeframe as an indicator of the relative difficulty of the problem, not an estimation of exact time spent.
Most of it would be generating the epub file and how best to output the pdf for cleaning.
|
Maybe you can.
Have you written a decompiler before? Or a BASIC interpreter? Or a program to convert, say, FORTRAN TO C++?
That is the scope of the problem.
See, pdf is not a simple data format. It is a full programming language, derived/extended from Postscript. pdf files are software, not bitmaps or encoded text blocks. You can write games and malware in pdf.
https://security.stackexchange.com/q...ontain-a-virus
This may help get you started:
https://nubuntu.org/postscript-vs-pdf
http://tailrecursive.org/postscript/postscript.html
Converting pdf to an editatable format is one of the great challenges of the age. Many have tried, millions in currency have been spent, none have fully succeeded. All require extensive manual cleanup.
If you succeed, people will shower you with cash.
Good luck!