View Single Post
Old 11-06-2012, 08:16 PM   #10
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 21,802
Karma: 30237628
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by DoctorOhh View Post
For the PDF files simply Saving to disk or Sending to device will embed the title and author in the title and author metadata fields of the PDF and it will place the tags in the keywords metadata field.
DrOhh. I was hoping you'd have a magic cure... as I've already said elsewhere, twice I think

Not if the existing PDF file Author already contains text. If it is not empty then the existing value is not overwritten, it retains the old value. The Title is always overwritten, as one would expect.

Quote:
Originally Posted by DoctorOhh View Post
I have no idea if the RTF format has any standard for metadata.
Maybe there are no de-jure standards, but there are some well known & understood 'conventions'. AFAIK they are as per the attached screen shot - I can edit them in MS Word and Open Office Writer - or at least I can with the versions I use. As for the data, what form it takes, and where it is - I think its XML packed into a zip somewhere near the top of the file - how original.

Whats more these conventions are known to and used by Calibre - in respect of Author and Title.

When I add an RTF (or a PDF) to Calibre it will extract the Title and Author from the metadata. Note : I have Read metadata from file contents rather than file name checked, I prefer to retain the original source file names.

When I save an RTF to disk or do an RTF->RTF conversion the Author will only be updated if the Author field in the current file is empty - which is exactly the same as PDF's.

I get around the problem by clearing Author field in the RTF and PDF files before adding it to Calibre - but I often forget. I also make errors entering Author names, which are often foreign names, from languages that use non Latin scripts - so error detection is often slow - i.e. months later.

I've reported the problem @ launchpad

The 'it doesn't matter because its in the Calibre database' argument doesn't wash, I want the metadata to be available to the content indexing IFilters.

BR
Attached Thumbnails
Click image for larger version

Name:	Capture.JPG
Views:	349
Size:	41.6 KB
ID:	95616  

Last edited by BetterRed; 11-06-2012 at 08:21 PM.
BetterRed is offline   Reply With Quote