Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > Other formats > LRF

Notices

Closed Thread
 
Thread Tools Search this Thread
Old 09-09-2007, 06:05 PM   #121
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,600
Karma: 28548974
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
why not just embed a python interpreter into your c++ app and use the python directly? Or better yet rewrite pdflrf in python, that's going to be a *lot* easier than translating python into c++
kovidgoyal is offline  
Old 09-09-2007, 11:34 PM   #122
KEM
Junior Member
KEM began at the beginning.
 
Posts: 1
Karma: 10
Join Date: Sep 2007
Device: PRS500
Smile Just tried it.

I just tried your tool on a 2 column pdf. The file is very readable on the PRS500 and the tool was easy to use.

Over all it worked great but the picture on the front page was converted to 4 pages and the last line of each page repeats on the following page. Is there something that I can change in my set up to fix that?
KEM is offline  
Advert
Old 09-10-2007, 12:03 AM   #123
cacapee
Connoisseur
cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.
 
Posts: 77
Karma: 1393
Join Date: Aug 2007
Location: Santa Monica
Device: prs-500
Set overlap to 0
cacapee is offline  
Old 09-12-2007, 03:25 PM   #124
evgen
Junior Member
evgen began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Dec 2006
Device: Sony Reader
One additional advantage of linking in the python interpreter is that you will get access to some nice tools like pyPdf, something I was able to swap in to my hacked version of pdfread to eliminate the obnoxious external dependency on pdftk, which can do nice things regarding pdf manipulation (e.g. write the TOC catalog with a small bit of hacking, etc)
evgen is offline  
Old 09-13-2007, 02:38 PM   #125
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,600
Karma: 28548974
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Quote:
Originally Posted by evgen View Post
One additional advantage of linking in the python interpreter is that you will get access to some nice tools like pyPdf, something I was able to swap in to my hacked version of pdfread to eliminate the obnoxious external dependency on pdftk, which can do nice things regarding pdf manipulation (e.g. write the TOC catalog with a small bit of hacking, etc)
You have a hacked version of pypdf that can read the toc catalog? If so can you send it to me: kovid _the usual email address separator_ kovidgoyal.net
kovidgoyal is offline  
Advert
Old 09-13-2007, 06:25 PM   #126
Vienna01
Old Dog Learns New Tricks
Vienna01 doesn't litterVienna01 doesn't litter
 
Vienna01's Avatar
 
Posts: 123
Karma: 142
Join Date: Nov 2006
Location: Maryland USA
Device: Sony PRS-500,PocketBook 301, Sony 650
Confused pdflrfwin.exe vs pdflrf_gui?

Confused pdflrfwin.exe vs pdflrf_gui?
Vienna01 is offline  
Old 09-15-2007, 06:49 AM   #127
ereszet
Zealot
ereszet has a complete set of Star Wars action figures.ereszet has a complete set of Star Wars action figures.ereszet has a complete set of Star Wars action figures.ereszet has a complete set of Star Wars action figures.
 
ereszet's Avatar
 
Posts: 118
Karma: 306
Join Date: Sep 2007
Device: Sony PRS-500 Archos 704 wifi
I have followed all your pdflrf releases with growing amazement of what you have achieved and how soon you responded to new demands and challenges. Apart from all the options that pdflrf offers, it is extremely fast. Believe me, I have tried scores of different programs/utilities (DOS/Windows/Ubuntu) to process pdf/djvu photos of old books (like Google books) before OCR-ing them with Finereader and none is even close to your program. Thank you.
And now is my humble suggestion. Can you include pdf as an output? Sony Reader is only one of many toys for reading books while pdf format is universal. It would be so useful to have pdf files readibility improved before OCR-ing them and storing them in my laptop library or reading with Archos 704 (I just ordered it and hope that 7" screen will make a difference to Sony's 6").
For your info, my workflow before discovering pdflrf was: 1. reading page images or pdfs to Finereader, 2. recognizing blocks of text/images, 3. saving images with blocks only (no white space surrounding it), 4. reading images back to Finereader, 5. OCR-ing, 6. saving to pdfs (text under image). Of course the original page images require a lot of cleaning before going to Finereader, because otherwise all black margins or blobs would be recognized as blocks and prevent removal of white space surronding the text.
ereszet is offline  
Old 09-15-2007, 06:54 AM   #128
ereszet
Zealot
ereszet has a complete set of Star Wars action figures.ereszet has a complete set of Star Wars action figures.ereszet has a complete set of Star Wars action figures.ereszet has a complete set of Star Wars action figures.
 
ereszet's Avatar
 
Posts: 118
Karma: 306
Join Date: Sep 2007
Device: Sony PRS-500 Archos 704 wifi
Is pdf output possible in future releases?

I have followed all your pdflrf releases with growing amazement of what you have achieved and how soon you responded to new demands and challenges. Apart from all the options that pdflrf offers, it is extremely fast. Believe me, I have tried scores of different programs/utilities (DOS/Windows/Ubuntu) to process pdf/djvu photos of old books (like Google books) before OCR-ing them with Finereader and none is even close to your program. Thank you.
And now is my humble suggestion. Can you include pdf as an output? Sony Reader is only one of many toys for reading books while pdf format is universal. It would be so useful to have pdf files readibility improved before OCR-ing them and storing them in my laptop library or reading with Archos 704 (I just ordered it and hope that 7" screen will make a difference to Sony's 6").
For your info, my workflow before discovering pdflrf was: 1. reading page images or pdfs to Finereader, 2. recognizing blocks of text/images, 3. saving images with blocks only (no white space surrounding it), 4. reading images back to Finereader, 5. OCR-ing, 6. saving to pdfs (text under image). Of course the original page images require a lot of cleaning before going to Finereader, because otherwise all black margins or blobs would be recognized as blocks and prevent removal of white space surronding the text.

BTW. As it is my first post to the forum, and I am having problems with sending it (I had to login a number of times), I am sorry if this post appear more than once.
ereszet is offline  
Old 09-15-2007, 09:19 AM   #129
DrMoze
Booknut
DrMoze plays well with othersDrMoze plays well with othersDrMoze plays well with othersDrMoze plays well with othersDrMoze plays well with othersDrMoze plays well with othersDrMoze plays well with othersDrMoze plays well with othersDrMoze plays well with othersDrMoze plays well with othersDrMoze plays well with others
 
Posts: 860
Karma: 2852
Join Date: Jul 2007
Location: West Palm Beach, Florida!
Device: Sony Reader 500/505/300/350, Nook Glowlight Plus (6")
I just tried (after finding in this thread) the pdflrfwin.exe utility. It's the first time I was able to successfully translate *anything* into lrf format! And the results make some pdfs (which I had given up on) quite readable. Yay!

Question: Are there any settings that can reduce the pdf file size, perhaps at the expense of a slight impairment in print darkness? For example, a 1.5MB pdf file was converted to 18.5MB! ANother pdf of 64kB became 1.1MB. At this rate, I can only fit a few medium-size pdf's on the netire reader! (Yes, I have an SD card, but I like using groups on the internal memory...)

Just wondering. But thanks ofr a great and easy-to-use utility!
DrMoze is offline  
Old 09-15-2007, 06:17 PM   #130
cacapee
Connoisseur
cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.
 
Posts: 77
Karma: 1393
Join Date: Aug 2007
Location: Santa Monica
Device: prs-500
version 0.8 adds support for Table of Contents in pdf files. It is possible to preview output images to test out various settings. An experimental linux build (built on Ubuntu) has been added. Improved threading support so processing should be faster. Changed default colors to 4 to reduce frequency of file size questions. Added more filtering options and better dithering so images should look a lot better

Last edited by cacapee; 09-15-2007 at 06:21 PM.
cacapee is offline  
Old 09-15-2007, 11:19 PM   #131
DrMoze
Booknut
DrMoze plays well with othersDrMoze plays well with othersDrMoze plays well with othersDrMoze plays well with othersDrMoze plays well with othersDrMoze plays well with othersDrMoze plays well with othersDrMoze plays well with othersDrMoze plays well with othersDrMoze plays well with othersDrMoze plays well with others
 
Posts: 860
Karma: 2852
Join Date: Jul 2007
Location: West Palm Beach, Florida!
Device: Sony Reader 500/505/300/350, Nook Glowlight Plus (6")
Ah, # of colors is (of course) the main reason for larger files.

BTW, v8 (pdflrfwin.exe) cuts out every tie I try to convert a second file. After selecting the second file, the program quits. (v7 did not have this problem.) I'm running WinXP Pro sp2.
DrMoze is offline  
Old 09-16-2007, 12:40 AM   #132
cacapee
Connoisseur
cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.
 
Posts: 77
Karma: 1393
Join Date: Aug 2007
Location: Santa Monica
Device: prs-500
I cannot reproduce it here. Does it happen with any kind of file?
cacapee is offline  
Old 09-16-2007, 01:52 AM   #133
timestory
Junior Member
timestory began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Aug 2007
Device: PRS500
Quote:
Originally Posted by cacapee View Post
version 0.8 adds support for Table of Contents in pdf files. It is possible to preview output images to test out various settings. An experimental linux build (built on Ubuntu) has been added. Improved threading support so processing should be faster. Changed default colors to 4 to reduce frequency of file size questions. Added more filtering options and better dithering so images should look a lot better

It is great to add ToC feature, I just converted one PDF book with one level bookmarks, when I imported the lrf file to reader, found that all those links are stored in Table of content under the menu for this book.

But when I converted a PDF book with multiple level bookmarks(one root), and imported this book, found that only the root item stored in Table of contents, all others are skipped.
timestory is offline  
Old 09-16-2007, 02:12 AM   #134
cacapee
Connoisseur
cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.cacapee is no ebook tyro.
 
Posts: 77
Karma: 1393
Join Date: Aug 2007
Location: Santa Monica
Device: prs-500
Ah, this will be fixed in the next release
cacapee is offline  
Old 09-16-2007, 01:24 PM   #135
leha
Member
leha began at the beginning.
 
Posts: 14
Karma: 10
Join Date: Nov 2006
Device: prs-500
wine

Hi, thanks for nice converter. Being linux user I have a remark. I am not sure how portable your code is and if you are interested in writing linux gui for your utility but it might be easier to keep your windows version wine compatible and skip on gui interface for linux. Of course I would like to see nice kde frontend but now it is easier to run windows version in wine then command line linux utility.
leha is offline  
Closed Thread


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
soPdf - Better than Yet another PDF to LRF converter theguru PDF 197 11-20-2012 04:54 PM
comiclrf - Comics(CBZ) to LRF converter FangornUK LRF 274 06-16-2010 02:24 PM
Book Processor - Anything to LRF and HTML converter LittleDragon Sony Reader 11 05-13-2008 04:31 PM
Quick/easy LIT to LRF converter? OUTATIME Sony Reader Dev Corner 10 02-29-2008 09:44 AM
Anyone else want chm to lrf converter? buster Sony Reader 10 02-09-2008 05:07 PM


All times are GMT -4. The time now is 05:58 PM.


MobileRead.com is a privately owned, operated and funded community.