05-26-2010, 04:39 AM | #16 |
Connoisseur
Posts: 98
Karma: 122982
Join Date: Apr 2010
Location: Humboldt County, California
Device: ipad, iPod touch, JetBook Lite
|
Hi greenapple,
I will try to get out a GUI front end for Windows within a week or so. I'm having to do this in java, and it's been a long time since I did any java programming. |
05-26-2010, 05:15 AM | #17 |
Evangelist
Posts: 404
Karma: 664
Join Date: Dec 2009
Device: Kindle Paperwhite, Kindle DX, Kobo Aura HD
|
Thanks, Pranananda. Looking forward to it!
|
Advert | |
|
05-27-2010, 03:22 PM | #18 |
Connoisseur
Posts: 98
Karma: 122982
Join Date: Apr 2010
Location: Humboldt County, California
Device: ipad, iPod touch, JetBook Lite
|
There is now a graphical user interface. You must have Java installed to run this interface, which you can get from http://java.com/download.
See the original post for the binaries and instructions. For Windows, the zip file contains all the binaries you need: pdfreflow.exe, pdftohtml.exe, and the Java jar file. For other platforms, you must already have pdfreflow and pdftohtml installed, and they must be in your path. There is a Help button that will bring up some online help. Last edited by Pranananda; 05-07-2023 at 10:02 PM. |
05-27-2010, 05:44 PM | #19 |
Grand Sorcerer
Posts: 6,208
Karma: 16534692
Join Date: Sep 2009
Location: UK
Device: Kobo: KA1, ClaraHD, Forma, Libra2, Clara2E. PocketBook: TouchHD3
|
This looks like a promising new PDF utility, Pranananda. Thank you for your hard work.
|
05-28-2010, 09:11 PM | #20 |
Man Who Stares at Books
Posts: 1,816
Karma: 10606722
Join Date: Mar 2010
Location: 50th State, USA. Also, PA, NY, CA, and elsewhere.
Device: All of the Above
|
Mighty Fine
Pranananda, the exe version you have on sourceforge seems to differ from the version you posted on MR. However, the newest version (0.8.6), with a GUI, is pretty good. I reflowed a pdf novel in 10 minutes flat, 9.5 of which were spent editing/proofing the resulting html file. The only fix, related to pdfreflow, was to change the style of p2 to text-align: center. The culprit Xml line was as follows:
<text top="179" left="110" width="244" height="71" font="5">BOOK TITLE </text> For some strange reason, there was no fontspec id="5" in the xml file, so I'm not sure how you interpreted the above. |
Advert | |
|
05-29-2010, 03:39 AM | #21 |
Connoisseur
Posts: 98
Karma: 122982
Join Date: Apr 2010
Location: Humboldt County, California
Device: ipad, iPod touch, JetBook Lite
|
jackie_w & Fat Abe, thanks for the positive feedback.
Abe, I just downloaded the zip files here on the original posts, and they do have the correct version (0.8.6) in them. The build times might be different because of my non automated techniques. But I did run the --version, and it reported 0.8.6. If people are having PDFs that should work but don't, I would love to hear about it and perhaps even get the PDF that is showing any defect in the reflow logic. |
05-29-2010, 04:20 AM | #22 |
Connoisseur
Posts: 98
Karma: 122982
Join Date: Apr 2010
Location: Humboldt County, California
Device: ipad, iPod touch, JetBook Lite
|
There is now an installer for a Macintosh user interface. It runs on Mac OS X Leopard and Snow Leopard, and it is in PDFReflow-0.8.6.1.dmg.zip.
The Windows version and the Ubuntu version of the user interface have been updated - PDFReflow-0.8.6.1-Setup.zip for Windows, and PDFReflow-0.8.6.1.jar.zip for Ubuntu. The command line version remains unchanged. The help has been corrected and enhanced on the user interface. Last edited by Pranananda; 05-07-2023 at 10:03 PM. |
05-29-2010, 02:44 PM | #23 |
Man Who Stares at Books
Posts: 1,816
Karma: 10606722
Join Date: Mar 2010
Location: 50th State, USA. Also, PA, NY, CA, and elsewhere.
Device: All of the Above
|
Wow, this keeps getting better and better. How about adding a box for font family, and a set of presets to automate the conversion process? I thank you for the effort you have put into the program. It is a godsend for those of us who are given documents in pdf format, but have to read them on small form factor eReaders.
|
06-03-2010, 05:27 PM | #24 |
Grand Sorcerer
Posts: 6,208
Karma: 16534692
Join Date: Sep 2009
Location: UK
Device: Kobo: KA1, ClaraHD, Forma, Libra2, Clara2E. PocketBook: TouchHD3
|
Hi Pranananda,
I'm having a few problems with a PDF I'm trying to reflow. In the output HTML, the first few lines of each chapter are out of sequence. Also, sometimes a multi-line chapter heading has its words out-of-sequence. I have attached a 2-page extract PDF which demos the problems. I would be grateful if you could find time to look at it and advise if/where I may be going wrong. I only set 2 parameters: Crop top = 98 and Crop bottom = 591 It doesn't surprise me that the chapter's initial DropCap might cause a problem, but the first few lines seem to be in the correct sequence in the XML but not the HTML. In addition, an unrelated minor problem I have found is that in the reflowed HTML, the opening <body> tag seems to have been output as a closing </body> tag, i.e the file has 2 closing body tags and no opening body tag. I assume this is a coding typo. |
06-03-2010, 11:03 PM | #25 |
Man Who Stares at Books
Posts: 1,816
Karma: 10606722
Join Date: Mar 2010
Location: 50th State, USA. Also, PA, NY, CA, and elsewhere.
Device: All of the Above
|
The first 7 lines specified in the xml file are:
<text top="45" left="130" width="184" height="18" font="0">C H A P T E R S I X </text> <text top="205" left="310" width="5" height="18" font="0"> </text> <text top="591" left="209" width="34" height="14" font="2"> 88 </text> <text top="221" left="79" width="289" height="25" font="3">THE JOURNEY FROM </text> <text top="248" left="106" width="236" height="25" font="3">PLATFORM NINE </text> <text top="275" left="66" width="315" height="25" font="3">AND THREE-QUARTERS </text> <text top="298" left="112" width="3" height="17" font="4"> </text> As presented, the page number 88 is specified on the 3rd line above, but is actually the last line of page 1. I have not looked at the source code for pdfreflow, but the actual line order that it should have decoded from the xml are the top locations 45, 205, 221, 248, 275, etc. However, the line heights of the sequence: THE JOURNEY FROM PLATFORM NINE AND THREE-QUARTERS cause the rendered sequence to be THE JOURNEY FROM AND THREE-QUARTERS PLATFORM NINE Just manually edit the xml file, and change the font size from 3 to 2 (in these lines), and then it will be in order again. Manually reorder the lines at top="299" and top="591". At top="463", there is a line height jump to 20 instead of the usual +16 due to an oversized font. After analyzing the xml file (which is a product of pdftohtml), I can sympathize with those developers who are working on pdf re-flowers. They seem to have to do some form of layout decoding and correction, as well as sorting and correction, to produce a perfect result. Last edited by Fat Abe; 06-03-2010 at 11:08 PM. |
06-04-2010, 03:11 AM | #26 | |||
Connoisseur
Posts: 98
Karma: 122982
Join Date: Apr 2010
Location: Humboldt County, California
Device: ipad, iPod touch, JetBook Lite
|
jackie_w,
Your example is showing up bugs in both pdftohtml and pdfreflow. Fat Abe's solution will work, for getting the titles and drop cap to work, another workaround is to change: Quote:
Quote:
Quote:
But there are also wrapping bugs in pdfreflow. It is wrapping too much text into some paragraphs, and getting confused about the start of new paragraphs, partly because of the drop cap. I am away from my home and home computer until this Sunday and it may be a week after I return before I put out version with a bug fix. |
|||
06-04-2010, 05:47 AM | #27 |
Grand Sorcerer
Posts: 6,208
Karma: 16534692
Join Date: Sep 2009
Location: UK
Device: Kobo: KA1, ClaraHD, Forma, Libra2, Clara2E. PocketBook: TouchHD3
|
@Abe and Pranananda, Thank you for taking the time to explain to me.
I look forward to the next release. |
07-11-2010, 12:30 PM | #28 |
Junior Member
Posts: 6
Karma: 10
Join Date: Jul 2010
Device: hanvon n516
|
It's the best I've ever seen! Being able to eliminating headers/footers and to reflow the text, it really is one of a kind! Thaaaank you, Pranananda!
|
08-30-2010, 04:24 AM | #29 |
Wizard
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
Looks very very good!
|
08-30-2010, 04:23 PM | #30 |
Groupie
Posts: 185
Karma: 1004070
Join Date: Jul 2010
Location: Italy
Device: Kindle for Android, Google Play Books
|
I installed PDFReflow 0.8.6.1 on a Fedora 11 Linux system (I already had popper-utils), but I am unable to generate HTML output. If I run
Code:
pdftohtml -xml mybook.pdf Code:
pdfreflow mybook.xml When I directly run the GUI, select the PDF document and click Reflow, a new GUI instance is started. In all these cases, no HTML output is generated. |
Tags |
pdf, reflow, utility |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
<pre> tags and no text reflow in EPUB | sergio blum | Calibre | 24 | 10-14-2010 08:07 PM |
What is the best reader read real reflow PDF ( not refow text ) ? | familyhandh | Which one should I buy? | 1 | 08-05-2010 08:44 AM |
Help with reflow text file | siulayhumga | Workshop | 9 | 07-31-2010 06:36 PM |
80-column text reflow - Hanlin V3 | elewton | Other formats | 1 | 02-10-2009 05:00 AM |
Now that the Sony 505 can reflow PDFs ... | mollybo | Sony Reader | 6 | 07-27-2008 11:29 PM |