|
|
#1 |
|
Connoisseur
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 57
Karma: 12544
Join Date: Jul 2018
Device: None
|
Is it possible to extract identifier values from epub file?
The website archiveofourown.org has a handy feature to directly download the entire story straight into epub format
The epub they provide you even contains syntax for url identifier where you downloaded it from. From the builtin "edit book" option in calibre below is the first few lines of the main file of the epub: *~*~*~*~*~*~* <?xml version='1.0' encoding='utf-8'?> <html xmlns="http://www.w3.org/1999/xhtml" lang="en" xml:lang="en"> <head> <meta content="ie=edge" http-equiv="x-ua-compatible"/> <title>The Way of Serenity</title> <meta http-equiv="Content-Type" content="text/html; charset=utf-8"/> <link rel="stylesheet" type="text/css" href="stylesheet.css"/> <link rel="stylesheet" type="text/css" href="page_styles.css"/> </head> <body class="calibre"> <div id="preface" class="calibre1"> <h2 class="toc-heading" id="calibre_toc_2">Preface</h2> <p class="message"> <b class="calibre2">The Way of Serenity</b><br class="calibre1"/> Posted originally on the <a href="http://archiveofourown.org/">Archive of Our Own</a> at <a href="https://archiveofourown.org/works/63545731">https://archiveofourown.org/works/63545731</a>. </p> *~*~*~*~*~*~* Would there be any way to use Calibre's conversion process to scrape this url text and use it as the identifer url (or just use it in the comments text)? |
|
|
|
![]() |
| Tags |
| conversation, identifier, url |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Bulk-copying identifier values | tn4w | Calibre | 3 | 12-28-2022 03:18 AM |
| Command that outputs a list of identifier values | tn4w | Calibre | 4 | 11-14-2022 08:51 AM |
| extract first line of html/text file pointed by TOC from epub | michaelbr | ePub | 1 | 09-03-2022 10:31 AM |
| Error while parsing file: different playOrder values for navPoint/navTarget/pageTarge | libre75 | Sigil | 4 | 04-04-2021 08:24 AM |
| Extract pages from large epub file | mykemitchell | Editor | 2 | 01-04-2015 08:13 PM |