View Single Post
Old 02-23-2023, 03:07 PM   #1
Lukusaukko
Connoisseur
Lukusaukko ought to be getting tired of karma fortunes by now.Lukusaukko ought to be getting tired of karma fortunes by now.Lukusaukko ought to be getting tired of karma fortunes by now.Lukusaukko ought to be getting tired of karma fortunes by now.Lukusaukko ought to be getting tired of karma fortunes by now.Lukusaukko ought to be getting tired of karma fortunes by now.Lukusaukko ought to be getting tired of karma fortunes by now.Lukusaukko ought to be getting tired of karma fortunes by now.Lukusaukko ought to be getting tired of karma fortunes by now.Lukusaukko ought to be getting tired of karma fortunes by now.Lukusaukko ought to be getting tired of karma fortunes by now.
 
Posts: 55
Karma: 392326
Join Date: Feb 2023
Device: Kobo Libra 2
Converting multicolumn PDF?

I've read the forums and saw that this question has come up in the past, and the answer has been that "it's in development" - however, these posts have been several years old, and the latest version still makes a mess of multiple column pdf's.
There are a few Linux-based tools that can convert multicolumn files relatively easily, such as pdftotext, but that removes italics and bolding. pdftohtml can also do it, but its output requires quite a lot of manual work to convert into a single-column format suitable for epub conversion. I've also tried using k2pdfopt to convert a pdf into single-column format to pass on to calibre, but that makes calibre choke - probably because the resulting file is not internally a true single-column pdf despite looking like it in a viewers.
Has the development of this functionality for Calibre been abandoned? Or are there any other tools - preferably available in linux - besides Acrobat itself that could be used to convert a multi-column pdf into epub or at least some intermediate format with italics and bolding intact?
Lukusaukko is offline   Reply With Quote