MobileRead Forums - View Single Post - Epub3 Foot- End-notes

Tex2002ans · 07-25-2021, 05:30 PM

Quote:

Originally Posted by mrevent

I'd like to have the main text of the book not interrupted with those numerous footnotes (which would be the case were I to attempt to convert the "printed" version of the page to epub) [...].

Looks like working from their "print view" would still be your best bet.

All the footnote text is there + the code is all clean.

All you'd have to do is use a few regexes to convert their code into the EPUB footnote HTML already discussed.

Quote:

Originally Posted by mrevent

The website in question is a great one: UC Press e-books collection, where 700 of them are accessible by the public.

The book in question is The Fabrication of Labor by R. Biernacki.

In their "print view", each of the "pages" has this basic form:

<hr> between pages
<div> page number
or for basic paragraphs
[##] for footnote paragraphs
[##] for footnote numbers.

Here's the relevant code for page 8:

Spoiler:

So, what I'd do is 2 regexes:

Search: \[(\d+)\]
Replace: <a class="ref" href="#fn\1" id="ft\1">[\1]</a>

Search: \[(\d+)\]
Replace: <a href="#ft\1" id="fn\1">[\1]</a>

That gets you all your EPUB clickable footnotes.

Now you'd just be left with the typical HTML cleanup:

Removing page code (or converting to RPNs ["Real Page Numbers"]).
Shifting all footnotes to the end-of-file.
Merging split paragraphs.
[...]