Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 09-20-2010, 08:25 AM   #1
Sweetpea
Grand Sorcerer
Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.
 
Sweetpea's Avatar
 
Posts: 9,707
Karma: 32763414
Join Date: Dec 2008
Location: Krewerd
Device: Pocketbook Inkpad 4 Color; Samsung Galaxy Tab S6
Adding PDF books

Is there a way for Calibre to read my PDFInfo files when I add PDF books?

When I add an epub book, the fields are all nicely filled (author, title, tag, etc). But with PDF books it completely ignores my PDFInfo fields (which are nicely filled, at least, Author, Title and Subject are).

If it doesn't, is there a way for it to read the folder structure (by using the reg expression)? Because the filenames of my books only contain a series index (if applicable) and the title of the book. The author (and series, again if applicable) are folder names...
Sweetpea is offline   Reply With Quote
Old 09-20-2010, 09:18 AM   #2
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by Sweetpea View Post
Is there a way for Calibre to read my PDFInfo files when I add PDF books?

When I add an epub book, the fields are all nicely filled (author, title, tag, etc). But with PDF books it completely ignores my PDFInfo fields (which are nicely filled, at least, Author, Title and Subject are).

If it doesn't, is there a way for it to read the folder structure (by using the reg expression)? Because the filenames of my books only contain a series index (if applicable) and the title of the book. The author (and series, again if applicable) are folder names...
No and no. Calibre can only pick up metadata from inside the file or from the filename, not from external files (pdfinfo) or the folder names. The usual solutions for your case are to either send the metadata inside the pdf file or change the filename with a file rename utility to include your metadata/foldernames.
Starson17 is offline   Reply With Quote
Old 09-20-2010, 10:06 AM   #3
Sweetpea
Grand Sorcerer
Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.
 
Sweetpea's Avatar
 
Posts: 9,707
Karma: 32763414
Join Date: Dec 2008
Location: Krewerd
Device: Pocketbook Inkpad 4 Color; Samsung Galaxy Tab S6
Quote:
Originally Posted by Starson17 View Post
No and no. Calibre can only pick up metadata from inside the file or from the filename, not from external files (pdfinfo) or the folder names. The usual solutions for your case are to either send the metadata inside the pdf file or change the filename with a file rename utility to include your metadata/foldernames.
The PDFInfo is inside the PDF file. It's called PDFInfo in LaTeX... you can see that data when you look at the properties of the PDF file.
Sweetpea is offline   Reply With Quote
Old 09-20-2010, 10:10 AM   #4
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,774
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
make sure your pdf metadata reader plugin is enabled in Preferences->Advanced->Plugins->Metadata rader plugins
kovidgoyal is online now   Reply With Quote
Old 09-20-2010, 10:58 AM   #5
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by Sweetpea View Post
The PDFInfo is inside the PDF file. It's called PDFInfo in LaTeX... you can see that data when you look at the properties of the PDF file.
In that case, assuming it's just the normal internally stored pdf metadata, it should work. You need to have the option in Prefs|Import/Export|Add to "read metadata from file contents" turned on. Some pdf files cause problems for the code library used by Calibre to handle pdfs and, as a result, some people have turned off the pdf metadata reader. If you've done that, turn it back on, per Kovid's comment.

edit: You should also be aware that Calibre will pick up metadata in an opf file stored with any other format in preference to the other two methods. If there's any chance you have some sort of .opf file with the same name as the pdf, it would override the other metadata.

Last edited by Starson17; 09-20-2010 at 11:19 AM.
Starson17 is offline   Reply With Quote
Old 09-21-2010, 02:16 AM   #6
Sweetpea
Grand Sorcerer
Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.
 
Sweetpea's Avatar
 
Posts: 9,707
Karma: 32763414
Join Date: Dec 2008
Location: Krewerd
Device: Pocketbook Inkpad 4 Color; Samsung Galaxy Tab S6
Quote:
Originally Posted by kovidgoyal View Post
make sure your pdf metadata reader plugin is enabled in Preferences->Advanced->Plugins->Metadata rader plugins
Hmm, it is turned on... Maybe it simply won't read it, because I generate my PDF's myself...
Sweetpea is offline   Reply With Quote
Old 09-21-2010, 02:47 AM   #7
itimpi
Wizard
itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.
 
Posts: 4,552
Karma: 950151
Join Date: Nov 2008
Device: Sony PRS-950, iphone/ipad (Marvin/iBooks/QuickReader)
Quote:
Originally Posted by Sweetpea;
Hmm, it is turned on... Maybe it simply won't read it, because I generate my PDF's myself...
Does this mean you have an alternative source format available? PDF is the worst format for conversion purposes so if you have an alternative you should consider using this instead.
itimpi is offline   Reply With Quote
Old 09-21-2010, 06:00 AM   #8
Sweetpea
Grand Sorcerer
Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.
 
Sweetpea's Avatar
 
Posts: 9,707
Karma: 32763414
Join Date: Dec 2008
Location: Krewerd
Device: Pocketbook Inkpad 4 Color; Samsung Galaxy Tab S6
Quote:
Originally Posted by itimpi View Post
Does this mean you have an alternative source format available? PDF is the worst format for conversion purposes so if you have an alternative you should consider using this instead.
My sources are HTML and the PDF's are the converted files. But I don't want to add all my formats to Calibre, because I have no need for multiple copies of all my files. Especially as my .OPF files are all called the same...
Sweetpea is offline   Reply With Quote
Old 09-21-2010, 06:05 AM   #9
itimpi
Wizard
itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.
 
Posts: 4,552
Karma: 950151
Join Date: Nov 2008
Device: Sony PRS-950, iphone/ipad (Marvin/iBooks/QuickReader)
HTML is the best format for conversion, so if you have that available use it for the conversion.
itimpi is offline   Reply With Quote
Old 09-21-2010, 06:26 AM   #10
Sweetpea
Grand Sorcerer
Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.
 
Sweetpea's Avatar
 
Posts: 9,707
Karma: 32763414
Join Date: Dec 2008
Location: Krewerd
Device: Pocketbook Inkpad 4 Color; Samsung Galaxy Tab S6
Quote:
Originally Posted by itimpi View Post
HTML is the best format for conversion, so if you have that available use it for the conversion.
As I said, HTML is my source format, PDF is the converted format... But I don't want to add the HTML + epub + mobi + PDF files. I don't like the fact that Calibre makes a copy of each book. I was just exploring if I could use Calibre to keep track of the books on my devices. I don't use the GUI to convert, the only conversion I use is by command-line (from mobi to HTML, the rest I do using my own tools).
Sweetpea is offline   Reply With Quote
Old 09-21-2010, 07:52 AM   #11
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by Sweetpea View Post
Especially as my .OPF files are all called the same...
If you have .opf files named the same as the .pdf file, they will override the pdf data and prevent it from being read. If you're having trouble importing a pdf, remove the .opf file from that folder and test that way.
Starson17 is offline   Reply With Quote
Old 09-21-2010, 10:17 AM   #12
Sweetpea
Grand Sorcerer
Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.Sweetpea ought to be getting tired of karma fortunes by now.
 
Sweetpea's Avatar
 
Posts: 9,707
Karma: 32763414
Join Date: Dec 2008
Location: Krewerd
Device: Pocketbook Inkpad 4 Color; Samsung Galaxy Tab S6
Quote:
Originally Posted by Starson17 View Post
If you have .opf files named the same as the .pdf file, they will override the pdf data and prevent it from being read. If you're having trouble importing a pdf, remove the .opf file from that folder and test that way.
No, my OPF's are all called metadata.opf...

I have a folder:

Source/authorname/bookname

which contains my metadata, toc.ncx and all other files in a nice epub-like structure

I have another folder:

PDF/authorname

which contains all books by that author which has been generated by using the files in my source/authorname/bookname folder.

When I try to add said PDF, it will not read the pdfinfo metadata, which can be seen if I open it with a pdf reader. Which means I have to manually change every single PDF and that kind of defeats the purpose of using metadata and thus, Calibre.
Sweetpea is offline   Reply With Quote
Old 09-21-2010, 10:47 AM   #13
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by Sweetpea View Post
When I try to add said PDF, it will not read the pdfinfo metadata, which can be seen if I open it with a pdf reader. Which means I have to manually change every single PDF and that kind of defeats the purpose of using metadata and thus, Calibre.
I was trying to find reasons why your pdf metadata wasn't being read. It apparently boils down to some incompatability between the way you create your internal pdf metadata and the library used by Calibre to read it.
As Kovid has said in connection with another PDF metadata problem:
Quote:
That bug is in the PDF library that calibre uses to set PDF metadata. Unfortunately, I don't know of any library that manages to set or for that matter even read PDF metadata with 100% success.
It looks like your problem is similar. I believe Calibre uses PoDoFo to parse and manipulate PDF metadata. I know PoDoFo was updated recently, and the new code was brought into Calibre to fix other issues with that library. If you decide your pdf metadata format is valid and PoDoFo is failing, you may want to make a bug report to them:
http://sourceforge.net/projects/podofo/
Starson17 is offline   Reply With Quote
Old 09-21-2010, 11:35 AM   #14
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,774
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Actually calibre uses poppler to read metadata, which is yet another library The reason for that is poppler allows rendering of the PDF, so calibre can render the first page as the cover.

Are you sure your PDF metadata is in the PDFInfo structure and not in the extended XML metadata?
kovidgoyal is online now   Reply With Quote
Old 09-21-2010, 12:01 PM   #15
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by kovidgoyal View Post
calibre uses poppler to read metadata
Does poppler replace PoDoFo, or do they do some different things? In addition to poppler, I know pdftohtml is used (or at least I think so). Are there any others used for pdf handling or manipulation?
Starson17 is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
delete old books after adding iconeo Calibre 4 05-21-2010 01:03 AM
Adding empty books vbussell Calibre 4 12-10-2009 05:42 PM
Help with adding books please stustaff Calibre 12 10-27-2009 02:30 PM
Adding cover image to PDF? silvijakk Bookeen 2 04-02-2009 03:33 PM
Trouble adding PDF file kmsulli Sony Reader 1 02-20-2007 10:26 PM


All times are GMT -4. The time now is 10:28 AM.


MobileRead.com is a privately owned, operated and funded community.