![]() |
Creating page lists on Windows – EPUBogrify alternative?
I am looking to improve accessibility in my EPUBs by adding a page list.
I was following Laura Brady's workflow (on LinkedIn Learning – here's essentially the same workflow detailed on epubsecrets), but unfortunately stalled at the point when it calls for a MacOS-only application called EPUBogrify at the point after using Pagestaker. Ideally I would like to keep my workflows on my workstation PC. In this MobileRead thread, BeckyEbook mentions that: Quote:
|
Perhaps the simplest replacement for EPUBgrify would be a Sigil plugin that would only do two things:
1. replacement. Find: Code:
<span class="com-rorohiko-pagestaker-style">(\d+)</span>Code:
<span epub:type="pagebreak" id="page\1" title="\1" />Code:
<li><a href="FILENAME.XHTML#\1">\1</a></li>In fact, the only problem is to put this file name in the text file, because there is no variable/placeholder in the "naked" Sigil that would allow you to insert the current file name in the "Replace" field. I do not complain about this lack, but this example only shows that such a need sometimes exists :) |
How do page lists improve accessibility?
|
We can add a nav page list generator to sigil if we could mark the spans with a sigil_pagelist class or some other way in the first replace.
I will look into itif people are interested. Also doing this in a plugin would be very straight forward. KevinH |
Quote:
The plugin supports NCX and NAV pagelists. EDIT: The plugin only supports books created with Sigil. If the nav file is not in the same folder as the other XHTML files, the page list hrefs will need to be manually updated. I might fix this limitation in the next version. |
@Doitsu: Oh, I don't know how I could have forgotten this plugin!
@Monaghan: The first point – replacement. The second point – use the PageList plugin. Good luck! |
Quote:
Doitsu's plugins really help, but you'd still need to go through and actually mark where page breaks actually occur. - - - If you're working in Microsoft Word, DAISY has since released a tool called "WordToEPUB": https://daisy.org/activities/software/wordtoepub/ and they've also released a few videos about it: Youtube: "WordToEPUB Extended Tutorial – Accessible EPUB in Seconds" (They even mentioned how great Sigil is! :D) ~38 mins is where they discuss generating page numbers. Same with that though, you have to manually mark your DOCX where pages occur, then the tools will help convert those numbers into the needed <a> or <span> markup. Quote:
Instead of jumping to the next screen/heading, you can navigate by jumping to the next page (think Text-to-Speech). You can also sync together with print-book readers (think a book club, classroom, or citation where they say "On page 45, this this and this occurs [...]") For more details, check out all the previous threads discussing this topic: 2020: "Correct Page Numbers in Kindle?" (Post #20) or do a search in your favorite search engine: Code:
page numbers Tex2002ans Doitsu site:mobileread.com |
Sounds like it all boils down to manually marking where pages occur. The rest can be easily automated (provided it's marked with all the info necessary to create the PageList).
|
Quote:
In Post #5 of "Create index on epub from printed book", I even explained how to:
Quote:
I know there was some InDesign plugin that automatically marked every page with a hidden anchor. But I'm not aware of such a thing for Word/LibreOffice or other programs. Everything I'm aware of is still the ol' manual markings. |
Thanks, all. This is all very helpful information.
Quote:
|
Quote:
Export an EPUB from InDesign, then post a few pages of code here (or just attach a sample EPUB). Once I see the pattern, it should be easy to map that over + come up with instructions for Doitsu's plugins. :D |
That's kind of you. It's basically just this tag:
Code:
<span class="com-rorohiko-pagestaker-style">85</span> |
One related question I have always had, if you look at the actual printed book, does the position of this tag end that page number or start that page number?
In other words using your example, is the Code:
<span class="com-rorohiko-pagestaker-style">85</span>The reason I ask (other than the off by one page issue) is that I have seen ebooks that tried this based on where the physical page number was printed in the pdf (top header or bottom footer) which makes it even more confusing. There must be a convention. |
Quote:
Code:
{Since the plugin will also add missing epub:type and id attributes, you might want to test it with a copy of your book. |
Quote:
See DAISY, Accessible Publishing Knowledge Base: "Page Navigation" Quote:
Quote:
Quote:
Find: <span class="com-rorohiko-pagestaker-style">(\d+)</span> Replace: <span epub:type="pagebreak" id="page\1" title="\1"/> That will convert that span into: Code:
<span epub:type="pagebreak" id="page85" title="85"/>Note: You may also wants a separate regex to deal with frontmatter with roman numeral page numbers (no idea how EPUBOgrify generates those). I usually use the ol': \b[xiv]+\b to find lowercase roman numerals... but definitely don't do a mass Search/Replace unless you know what you're doing. :P If EPUBOgrify uses the same code, it'll be: Find: <span class="com-rorohiko-pagestaker-style">(\b[xiv]+\b)</span> |
Quote:
|
Code from EPUBOgrify:
Find: <span class="com-rorohiko-pagestaker-style">(\d+|[ivxclm]+)</span> |
Quote:
"On page 123, author wrote X, Y, Z." You need the marker to jump you to the very beginning of that page's text. Quote:
|
Quote:
|
Quote:
|
You wonderful people. Thanks for all.
|
| All times are GMT -4. The time now is 10:28 PM. |
Powered by: vBulletin
Copyright ©2000 - 3.8.5, Jelsoft Enterprises Ltd.
MobileRead.com is a privately owned, operated and funded community.