03-23-2015, 09:48 AM | #661 |
Connoisseur
Posts: 77
Karma: 12
Join Date: Jun 2010
Device: Kindle
|
PDF Plugin to Identify if it has TEXT (OCR) or not
Would it be possible - because if so I think it would be very valuable/useful - to be able to have a plug-in that would check a PDF to see if the Text has been recognized - or if it is just an image.
And to create a custom field and column whereby you could see and sort those which HAVE/HAVE not. Perhaps even - if this too is possible - to have that plug-in then have the PDF (or if it could create a batch process even better - for a group of files) perform the Text Recognition and then re-save the file. |
03-23-2015, 10:33 AM | #662 | |
Ex-Helpdesk Junkie
Posts: 19,422
Karma: 85397180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
|
Quote:
|
|
Advert | |
|
03-23-2015, 11:38 AM | #663 |
Connoisseur
Posts: 77
Karma: 12
Join Date: Jun 2010
Device: Kindle
|
Thanks
I did not realize that - and have not used plugboards so I'll have to do a little reading/learning to see if I can figure that out.
Very much appreciated. |
03-23-2015, 11:40 AM | #664 |
Connoisseur
Posts: 77
Karma: 12
Join Date: Jun 2010
Device: Kindle
|
Oh - one clarification
When you said "natively" handles that - are you saying it AUTOMATICALLY is updating these fields WITHIN the PDF - by default?
Or are you saying it is something that you have to TURN ON - and then use PLUGBOARDS to tweak the particulars? Or is it that BOTH aspects are handled through plugboards? |
03-23-2015, 12:02 PM | #665 |
Connoisseur
Posts: 77
Karma: 12
Join Date: Jun 2010
Device: Kindle
|
A PDF Split Plugin - Like the Epub Split Plugin
Hi,
I currently use - on a Mac - a program called PDF EXPLODE to break a PDF into Chapters or Sections files. [NOTE: I was trying to link to PDF EXPLODE - which comes from http://onekerato.me - but when I go there - I can't find a direct link to the program (although some interesting articles) now. I did find another program being called by the same name (just different CAP for one of the letters) - that I am going to try out: PDF-eXPLODE. I have the Epub Split and Merge plugins installed. And I wonder if there would be a way to create a similar plugin for PDFs. Either doing what PDF EXPLODE does - or alternative (if possible - accessing PDF EXPLODE for this with the plug-in). I know I can do this by individual file - using the OPEN WITH - and then manually importing the results into Calibre. But I was wondering if a Plug-In could do this internally in one step - select the PDF(s) - click the plugin - and set it on how to save the new file's metadata (like: setting a "tag" as Chapter, or Section, or something like that; and a tag with the original record (and full book) - possibly linking back to it) - and then automatically add the results - and setting how you want the name to appear (i.e. CHAPTER NAME - BOOK NAME, or variations thereof). |
Advert | |
|
03-23-2015, 12:03 PM | #666 | |
Guru
Posts: 631
Karma: 7544080
Join Date: Apr 2013
Location: Berlin
Device: PRS 350, Kobo Aura
|
Quote:
|
|
03-23-2015, 12:04 PM | #667 | |
Connoisseur
Posts: 77
Karma: 12
Join Date: Jun 2010
Device: Kindle
|
Quote:
|
|
03-23-2015, 12:05 PM | #668 |
Connoisseur
Posts: 77
Karma: 12
Join Date: Jun 2010
Device: Kindle
|
Thanks DickLoraine.
|
03-23-2015, 05:27 PM | #669 | |
Ex-Helpdesk Junkie
Posts: 19,422
Karma: 85397180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
|
Quote:
calibre will never write metadata to any format, however, except as dickloraine said -- if you export the book (e.g. send-to-device, send-to-email, save-to-disk) or use Embed Metadata (manually update all supported metadata in all supported formats) or Polish Books (manually update metadata and do a lot more, for EPUB and AZW3 only.) Plugboards are an optional extra that allows you to redefine metadata fields, e.g. instead of calibre writing the title field to the book's title metadata, you can tell calibre to write "title (series, Book #)" to the book's internal title metadata. Plugboards only take effect when exporting a book, not when using Embed Metadata/Polish Books. |
|
03-23-2015, 06:13 PM | #670 |
null operator (he/him)
Posts: 20,550
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
@Philosopher - Calibre Embed writes all the metadata to PDF's in an XML block. I'm not sure what PDF readers will show it, probably Acrobat, given that XML is Adobe spawn. PDF XChange Free does not show the XML metadata, but Pro might.
And as I posted in another of your threads exiftool will show it. That's the one I rely on, by default it shows everything without the xml verbage, just a simple two column list -- label:value. BR |
03-25-2015, 04:32 PM | #671 |
Addict
Posts: 296
Karma: 1599870
Join Date: Jun 2012
Device: none
|
I would like to see a "Create Note" plugin. It would work identically to "Add Empty Book" except it would include an empty text file as a book format.
The reason why is that I sometimes like to keep notes in Calibre about collections of books, such as series. Right now I drag and drop an empty text file into Calibre, edit the metadata, and double-click it to open it in a text editor to make my changes there. It would be more convenient to have a menu option to create the text file for me without having to go back to Windows Explorer and browse to where I keep the empty text file. Heck, with this one little change, Calibre could be a serious note taking app! Obviously it was never meant for that, which is why this would be better as a plugin than as a native feature. Edit: I noticed you can create an empty epub with the Add Empty Book, but that's not particularly useful for those of us who aren't familiar with editing epubs. Text files are so much easier to work with. Last edited by fidvo; 03-25-2015 at 04:37 PM. |
03-25-2015, 10:59 PM | #672 | |
Well trained by Cats
Posts: 29,778
Karma: 54830978
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
Step 2) Drop the txt (created with your fav editor) file into the formats box I create Series (Index.txt) files all the time for largish seried (I cut from series listings on the web and paste into the txt file) |
|
03-25-2015, 11:15 PM | #673 | |
null operator (he/him)
Posts: 20,550
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
Add (press A) will always open to last used folder - just put a blank txt file somewhere 'adjacent' BR Last edited by BetterRed; 03-25-2015 at 11:21 PM. |
|
03-26-2015, 12:45 AM | #674 | |
Plugin Developer
Posts: 6,307
Karma: 3966249
Join Date: Dec 2011
Location: Midwest USA
Device: Kindle Paperwhite(10th)
|
Quote:
I've submitted a bug report/patch to add it. We'll see if Kovid accepts it or not. |
|
03-26-2015, 05:22 AM | #675 |
null operator (he/him)
Posts: 20,550
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
@fidvo - another idea - create an empty book (ctrl/shift/e) and put notes in Comments, or in a long text custom column (#notes/Notes)
Then you can display in Book Details sidebar, you'll also have the benefit of some formatting - bold, underline etc BR |
Tags |
chatbot, epub fix, epub-fix, google books, metadata calibre title, pdf, pdf and calibre, plugin development |
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
PRS-505 Any ideas what this might be? | Neupy | Sony Reader | 4 | 07-03-2012 07:19 AM |
New Plugin Type Idea: Library Plugin | cgranade | Plugins | 3 | 09-15-2010 12:11 PM |
Ideas? | mike_bike_kite | Which one should I buy? | 10 | 06-13-2010 03:37 PM |
Ideas | F1Wild | Amazon Kindle | 4 | 07-10-2009 06:01 AM |