MobileRead Forums - View Single Post

compurandom · 03-05-2020, 07:06 AM

Most pdf authors don't bother to set the metadata in the PDF correctly.

Guessing the metadata from the content might work for well formatted metadata like ISBN, but to get the rest, you'd have to "read" it semantically. Care to write an AI for that? I have trouble myself finding the real copyright date in gutenberg books.

03-05-2020, 07:06 AM	#1329
compurandom Wizard Posts: 1,031 Karma: 500000 Join Date: Jun 2015 Device: Rocketbook, kobo aura h2o, kobo forma, kobo libra color	Most pdf authors don't bother to set the metadata in the PDF correctly. Guessing the metadata from the content might work for well formatted metadata like ISBN, but to get the rest, you'd have to "read" it semantically. Care to write an AI for that? I have trouble myself finding the real copyright date in gutenberg books.