![]() |
#1 |
Connoisseur
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 57
Karma: 12544
Join Date: Jul 2018
Device: None
|
Is it possible to extract identifier values from epub file?
The website archiveofourown.org has a handy feature to directly download the entire story straight into epub format
The epub they provide you even contains syntax for url identifier where you downloaded it from. From the builtin "edit book" option in calibre below is the first few lines of the main file of the epub: *~*~*~*~*~*~* <?xml version='1.0' encoding='utf-8'?> <html xmlns="http://www.w3.org/1999/xhtml" lang="en" xml:lang="en"> <head> <meta content="ie=edge" http-equiv="x-ua-compatible"/> <title>The Way of Serenity</title> <meta http-equiv="Content-Type" content="text/html; charset=utf-8"/> <link rel="stylesheet" type="text/css" href="stylesheet.css"/> <link rel="stylesheet" type="text/css" href="page_styles.css"/> </head> <body class="calibre"> <div id="preface" class="calibre1"> <h2 class="toc-heading" id="calibre_toc_2">Preface</h2> <p class="message"> <b class="calibre2">The Way of Serenity</b><br class="calibre1"/> Posted originally on the <a href="http://archiveofourown.org/">Archive of Our Own</a> at <a href="https://archiveofourown.org/works/63545731">https://archiveofourown.org/works/63545731</a>. </p> *~*~*~*~*~*~* Would there be any way to use Calibre's conversion process to scrape this url text and use it as the identifer url (or just use it in the comments text)? |
![]() |
![]() |
![]() |
Tags |
conversation, identifier, url |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Bulk-copying identifier values | tn4w | Calibre | 3 | 12-28-2022 02:18 AM |
Command that outputs a list of identifier values | tn4w | Calibre | 4 | 11-14-2022 07:51 AM |
extract first line of html/text file pointed by TOC from epub | michaelbr | ePub | 1 | 09-03-2022 09:31 AM |
Error while parsing file: different playOrder values for navPoint/navTarget/pageTarge | libre75 | Sigil | 4 | 04-04-2021 07:24 AM |
Extract pages from large epub file | mykemitchell | Editor | 2 | 01-04-2015 07:13 PM |