Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre > Related Tools

Notices

Reply
 
Thread Tools Search this Thread
Old 11-04-2014, 03:41 PM   #1
EnergyLens
Hack
EnergyLens began at the beginning.
 
Posts: 34
Karma: 12
Join Date: Dec 2009
Device: Kobo Aura HD, Kindle Paperwhite
AutoCrop (OpenWith solution for Briss)

Here is a little script that can be called from the OpenWith plugin in Calibre to automate the use of Briss <http://sourceforge.net/projects/briss/> to Crop PDFs for easier reading on your eReader.

You need to make the file executable and edit this line in the file:

# Must specify the location of briss-0.9.jar on your system
path_to_briss = '/Applications/ebook tools/briss-0.9/briss-0.9.jar'

For those on Mac: I just updated to Yosemite and had to re-install Java to keep Briss working: <http://support.apple.com/kb/DL1572>

The program will make a copy of your original PDF and add an ORIGINAL_PDF filetype into the Calibre database (similar to ORIGINAL_EPUB). You can then try to teach your OS to open the ORIGINAL_PDF filetype with the PDF Reader of your choice (though Preview refuses... so I had to use Adobe on Mac).
Attached Files
File Type: zip AutoCrop.py.zip (1.9 KB, 725 views)
EnergyLens is offline   Reply With Quote
Old 11-05-2014, 06:58 AM   #2
EnergyLens
Hack
EnergyLens began at the beginning.
 
Posts: 34
Karma: 12
Join Date: Dec 2009
Device: Kobo Aura HD, Kindle Paperwhite
I also wanted to note that the inspiration for this was not primarily to make PDFs more readable on eReaders (which it does), but to streamline cleanup after OCR of PDF files. Briss does a great job of automatically cutting off page numbers and chapter names which occur on every page, thus making the resulting text much more readable.

I use PDFOCRx on Mac, which does a great job with two-column PDFs, and produces soft line-wraps. Sometimes OCR works even better (in terms of cleanup) than extracting the text from the native PDF.

I also have a custom column in my Calibre library which sorts PDFs into three type: Scanned, Scan with embedded OCR data, and Native. I populate this column when loading new PDFs into the library and then later the value of the column helps me to decide how to process the file when converting to other formats.
EnergyLens is offline   Reply With Quote
Advert
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
PDF cropping software: BRISS laborg PDF 331 08-18-2023 08:30 AM
Content PDF too slow, problems with briss squelchy451 Amazon Kindle 2 10-02-2012 11:56 PM
BRISS and BOOKDESIGNER arslonga PDF 2 08-05-2011 03:00 PM
PDF conversion after Briss Chewy2426 Conversion 5 06-03-2011 11:01 AM
Cropping .pdfs with Briss and converting with Calibre mrslecavalier Amazon Kindle 6 07-13-2010 07:53 PM


All times are GMT -4. The time now is 07:06 PM.


MobileRead.com is a privately owned, operated and funded community.