View Single Post
Old 09-02-2018, 05:34 AM   #53
sealbeater
Banned
sealbeater ought to be getting tired of karma fortunes by now.sealbeater ought to be getting tired of karma fortunes by now.sealbeater ought to be getting tired of karma fortunes by now.sealbeater ought to be getting tired of karma fortunes by now.sealbeater ought to be getting tired of karma fortunes by now.sealbeater ought to be getting tired of karma fortunes by now.sealbeater ought to be getting tired of karma fortunes by now.sealbeater ought to be getting tired of karma fortunes by now.sealbeater ought to be getting tired of karma fortunes by now.sealbeater ought to be getting tired of karma fortunes by now.sealbeater ought to be getting tired of karma fortunes by now.
 
Posts: 666
Karma: 1752814
Join Date: Jan 2008
Device: Sony Reader PRS-505 : Onyx Boox Max : Sony PRS-900 : Onyx Kepler Pro
Quote:
Originally Posted by darryl View Post
@sealbeater. If it is that easy, do it.
As I already said, I have neither the time nor the interest. I may tho, I just may...I'm curious to see how good xml to html would be.

Quote:
Originally Posted by darryl View Post
I won't bore you with the history but there are many books available only as pdf's. I avoid this format like the plague on e-ink readers.
Good for you. I remember when pdfs were difficult on e-readers. As I already stated, I've converted plenty. I'm aware of the pitfalls and I'm also aware of the workarounds.

Quote:
Originally Posted by darryl View Post
Your comment about ispell in your last post simply showcases your ignorance, as it is showcased at many other places in your posts on this subject.
Somehow, I doubt it's *my* ignorance that's being showcased. Certainly hasn't been so far.

You can think I'm ignorant if you like however. I don't mind.

Quote:
Originally Posted by darryl View Post
Spell checkers generally are zero help with things like homophones or many OCR errors or layout errors.
You don't say. Why would they be? Did I say somewhere that they would be?

Quote:
Originally Posted by darryl View Post
Nor do they deal with things like page headers and footers including page numbers which are fine with a fixed layout but not with a reflowable epub. .
I think I mentioned sed. Do you know what that is?


Quote:
Originally Posted by darryl View Post
Putting it quite bluntly I have never found a tool which takes a pdf as input and produces an intermediate format or an epub which does not require substantial manual editing. Results vary from readable to rubbish..
Perhaps that depends on your method and the type of pdf you were sourcing. Different pdfs require different methods. I have found the Poppler toosl quite adequete in taking pdfs as input and outputting intermediate formats. Have you ever converted a pdf to xml?

Putting it quite bluntly, no matter what, I've managed to find ways.

Quote:
Originally Posted by darryl View Post
With a little coding knowledge I'm sure it is trivial to write a script that converts pdf to epub very badly.
I'm glad you are only speaking for yourself.

Quote:
Originally Posted by darryl View Post
Do you think it is trivial to write such a script which reliably produces near perfect results? How about even marginally acceptable results? If you do it is time to put up or shut-up. If not, then .....
Why are you asking me questions I have already answered? "Near perfect results". I was quite clear when I said "good enough to suit my needs" and the results would depend on the quality of the pdf but yes, I do.

As for putting up or shutting up, I'm required to do neither. I don't jump though hoops for you. Even if I were to write such a script, I doubt you would be able to even run it. I doubt you have even heard of half of the tools I could use.
sealbeater is offline   Reply With Quote