View Single Post
Old 02-07-2020, 07:53 AM   #16
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,306
Karma: 13057279
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by thymesnewroman View Post
I didn't use Calibre at all. [...] Perhaps the original person who created this epub took a "wrong turn" with the notes/linking/coding?
Yes, because index_split_000 is one of the ways Calibre splits files... so it's a pretty sure sign this book was run through a Calibre conversion at some point.

* * *

What's the name of this book?

Perhaps there's a better quality file floating around.

Many times it's better to work from a much better source than to try to clean up a poorly-converted mess.

Quote:
Originally Posted by exaltedwombat View Post
It's going to be easier to clear this up if you make the code of an entire chapter available to us.
While this might be better... from the samples given, you can already tell the person who initially converted this botched it up so badly.

Look closely at the Footnote code from Example #1:

(I'm going to rip out the class="" crap so we can see

Code:
<a href="../Text/index_split_000.xhtml#anchor22">14</a>

[...]

<p><span id="anchor129"></span><a href="../Text/index_split_000.xhtml#anchor60">14</a>
The footnote 14 number says jump to anchor22, but the end of the chapter says anchor129...

I'm assuming the entire book's IDs are completely mangled as well. (They don't match up in any of the examples you gave.)

Quote:
Originally Posted by thymesnewroman View Post
index_split_001.xhtml is the Table of contents.
index_split_003.xhtml is Preface.
[...]
And so on and so forth.
If you do decide to continue trying to recover this mess of a file...

It may be easier to rename all the files to human-readable names.

In Sigil, on the left-side, you'll see the Book Browser (where it lists all the files). You can Right-Click files, then Rename.

So then you could rename:

index_split_003.xhtml -> Preface.xhtml
index_split_005.xhtml -> Part1.xhtml
[...]

That will at least make the spaghetti of links more readable. Then you would be able to more easily tell:

"Whoops, this Chapter 1 footnote actually points to titlepage.xhtml... no wonder it's broken."

And while this ebook may be recoverable... perhaps this cleanup project may be too advanced for someone who doesn't know the technical innards yet.

Last edited by Tex2002ans; 02-07-2020 at 07:56 AM.
Tex2002ans is offline   Reply With Quote