A while ago, I made an archive of a website that I needed to use offline. On this website are around 700 articles stored in individual HTML files that are full of useful info that I now want to load into my eReader app. However, I have hit an issue with importing them directly, as the title on the webpage isn't the name of the article, it's the website itself. An unedited import leads to 700 identical entries. However, the actual titles of the articles are stored within the HTML files under the headings tags (specifically,
for the title, and
for the author's name).
Is there any way to get Calibre to automatically extract these headings and use them for metadata entries, or at least change the title to the name of the article?