Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Sigil

Notices

Reply
 
Thread Tools Search this Thread
Old 12-11-2021, 10:54 PM   #1
roland1
Connoisseur
roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.
 
roland1's Avatar
 
Posts: 80
Karma: 2137678
Join Date: Dec 2021
Location: Canada
Device: none
Indexing in SIGIL - How to add "See also..."

Hi folks. First post. I'm using SIGIL to create an ebook for a non-fiction book that I've already released in paperback and hardcover. It has a fairly extensive index with a lot of sub-topics and cross referencing.

I've managed to get everything to work except that I'm not sure how to approach adding "See also (topic here)" after the index topic.

The index editor is too broad sweeping, so I've gone with marking the keywords in the code. That way I know it's going to stay in place.

Q. What is the best way to add "See also" lines to the index? Also, each time I update the index, are those new lines going to get overwritten again? (I don't want to create new file; I want everything written into one file and to stay there).

Any advice? Thanks.

P.S. I'm also a web designer so the ebook xhtml stuff is a breeze. But the logic behind using SIGIL for this one index task remains a mystery.
roland1 is offline   Reply With Quote
Old 12-12-2021, 02:22 PM   #2
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,306
Karma: 13057279
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by roland1 View Post
Hi folks. First post.
Hey. Welcome to MobileRead!

Quote:
Originally Posted by roland1 View Post
I'm using SIGIL to create an ebook for a non-fiction book that I've already released in paperback and hardcover. It has a fairly extensive index with a lot of sub-topics and cross referencing.
1. What's the original format? DOCX? InDesign?

2. Is this a plain Index? Or is it a fully linked Index?
  • Plain Index
    • You just have a formatted list of terms + page numbers.
  • Fully Linked Index
    • The exact location of your words/terms are marked within the text.
    • You can "update" the Index, then Word/InDesign/whatever will generate all new page numbers + formatting for you.

(If "parrots" moved from page 100->120, will you have to manually update everything? Or will the source file update itself?)

Quote:
Originally Posted by roland1 View Post
P.S. I'm also a web designer so the ebook xhtml stuff is a breeze. But the logic behind using SIGIL for this one index task remains a mystery.
Fully linking Indexes in ebooks is... very intensive + arguably not that beneficial (see all the in-depth links/topics far below).

- - -

1. If you don't mind a plain index:

Then insert it into the ebook as is.

(This is what I do.)

2. If you insist on fully linking your index:

Go back to the source files and generate Real Page Numbers (RPNs).

(While you're at it, you can generate the EPUB PageList!)

3. Don't insert the index file into the ebook at all.

(Many publishers decide to do this. I don't agree. A dumb/plain index is better than no index at all... even if "an ebook has search"... Indexes serve completely different purposes.)

Quote:
Originally Posted by roland1 View Post
Q. What is the best way to add "See also" lines to the index? Also, each time I update the index, are those new lines going to get overwritten again? (I don't want to create new file; I want everything written into one file and to stay there).
Don't use Sigil's Index Editor.

Work from what you already have:
  • the source files
  • the print book's Index

If you still want to go through with this...

- - -

Your best bet is probably:

Step 1. Recreate Real Page Numbers (RPNs).

(Marking the HTML with a <a id="page123"></a> where the pagebreaks occur.)

PDF:

Code:
This is an example that was split

-----------------

between two pages.
HTML:

Code:
<p>This is an example that was split<a id="page123"></a> between two pages.</p>
Step 2. Take your current plaintext Index, then use regex to convert all the numbers into links:

PDF:

Spoiler:
Code:
Animals
	cats, 5
	dogs, 123
	parrots, 200
[...]
Mammals. See also Animals.


HTML (Plain Index):

Spoiler:
Code:
<p class="index">Animals</p>
<p class="indextwo">cats, 5</p>
<p class="indextwo">dogs, 123</p>
<p class="indextwo">parrots, 200</p>
[...]
<p class="index">Mammals. <i>See also</i> Animals.</p>


HTML (Linked Index):

Spoiler:
Code:
<p class="index">Animals</p>
<p class="indextwo">cats, <a href="../Text/merged.xhtml#page5">5</a></p>
<p class="indextwo">dogs, <a href="../Text/merged.xhtml#page123">123</a></p>
<p class="indextwo">parrots, <a href="../Text/merged.xhtml#page200">200</a></p>
[...]
<p class="index">Mammals. <i>See also</i> Animals.</p>


Step 3. Convert your HTML <a href="page123"> links into:
  • EPUB2 (pageList)
  • EPUB3 (page-list)

You can then use Doitsu's "PageList" plugin for Sigil to generate the required RPN files for your EPUB.

- - - - - -

For more detailed information, see some of the recent topics on this:

If you need even more information, you may also want to look this up in your favorite search engines:

Code:
RPNs Tex2002ans site:mobileread.com
RPNs Hitch site:mobileread.com

Indexes EPUB Tex2002ans site:mobileread.com
Indexes EPUB Hitch site:mobileread.com
Me and Hitch (and others) have discussed this to death over the years.

We even covered how useful linked Indexes in an ebook even are... if the relevant text could be multiple "screens" away. For example, the famous:

Quote:
Originally Posted by roland1 View Post
I've managed to get everything to work except that I'm not sure how to approach adding "See also (topic here)" after the index topic.
Sigil's Index Editor really isn't meant for generating things like that.

It was designed to (dumbly) link to all references of a word/term.

It wasn't designed to generate complicated Indexes + Index formatting like:
  • See also
  • 123ff.
  • 123n
  • 123n10
  • 123–150
  • [...]

- - -

Side Note: I'd argue that this "concordance" (a list of all usages of a word) is even worse than no Index at all!

Side Note #2: I'd even argue this form:

Code:
example [1], [2], [3]
word [1], [2], [3]
is also completely unhelpful for actual readers.

Last edited by Tex2002ans; 12-12-2021 at 03:34 PM.
Tex2002ans is offline   Reply With Quote
Old 12-12-2021, 02:46 PM   #3
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 9,070
Karma: 6361556
Join Date: Nov 2009
Device: many
Yes, the Index Editor was just meant to do the grunt work. Once all words are marked and the index.xhtml has been created once, then you will need to edit the resulting index.xhtml and NOT regenerate the index or you will lose all changes. Then you will need to use regular expression search and replaces to add the "See also text and if desired links to it *inside* the index.

If the number of these to handle is too large, then think about creating the list of search and replacement regular expressions in Excel and then export it as CSV and importing from CSV into Sigil's Saved Searches and the running that Saved Search List (once).

To see the required columns for Saved Searches, select a current set of saved searches and output them to csv and visually inspect it first.

I have taken a text paired word list and used perl/python to create the search entries I wanted, imported it into Excel for cleanup and then loaded it into Sigil's Saved Searched and then ran them on the current file (index.html)

Don't forget to make a Checkpoint of your epub just before running your imported saved search in case you make mistakes and need to retry after reverting to the Checkpoint.

Last edited by KevinH; 12-12-2021 at 03:00 PM.
KevinH is offline   Reply With Quote
Old 12-12-2021, 07:00 PM   #4
roland1
Connoisseur
roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.
 
roland1's Avatar
 
Posts: 80
Karma: 2137678
Join Date: Dec 2021
Location: Canada
Device: none
Thanks, Tex and Kevin.

Crap: I just wrote a decent, lengthy post with a lot of info, clicked "go advanced" and lost it all after I had to log in again. DOH!

re; Index: I assumed I would have to add the "See also" after finishing the index because it made sense that it would just get erased with each overwrite. I can live with that.

I'm taking the word for word link coding approach. Very tedious, but if I get through this process once, I won't have to do it again.

I used Affinity Publisher for the print book layout. Excellent and cheap, BUT it does not export to anything but PDF for now. I have Scrivener, but I tried SIGIL because I could see that Scrivener was going to be a struggle in not offering me a WYSIWYG experience on the fly.

I also can't code my way past "Hello World" so I don't really see myself doing anything beyond my current skill set —*a few GREP searches whose successful outcome is often more the result of dumb luck than skill.

I'm going to reread your posts. You've given me much more to think about; including the usefulness of indexes in a search friendly environment. But then again, mine is a social/human psychology book, so it merits a little bit more work to flesh out. It has a glossary, too, so I can't really scrimp on the index too much.

Also, I like SIGIL because it allows me to bounce back and forth between the programs I'm used to using for web coding. I do know my html and css, so at least I'm lucky there.

Thanks again.
roland1 is offline   Reply With Quote
Old 12-12-2021, 07:06 PM   #5
roland1
Connoisseur
roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.
 
roland1's Avatar
 
Posts: 80
Karma: 2137678
Join Date: Dec 2021
Location: Canada
Device: none
P.S. I haven't really used "checkpoints" but rather I just keep making backup copies as my work progresses. Takes up more real estate, but it's a workflow I'm used to.
roland1 is offline   Reply With Quote
Old 12-13-2021, 12:42 AM   #6
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,306
Karma: 13057279
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by roland1 View Post
Crap: I just wrote a decent, lengthy post with a lot of info, clicked "go advanced" and lost it all after I had to log in again. DOH!
Yep. Always best to type everything in an external text editor. THEN you can copy/paste it into the box.

Won't lose anything that way.

Quote:
Originally Posted by roland1 View Post
P.S. I haven't really used "checkpoints" but rather I just keep making backup copies as my work progresses. Takes up more real estate, but it's a workflow I'm used to.
Yep, that's a similar method to what I use too.

Tip: Save files by proper date:
  • Book.Title[YYYY.MM.DD]

that will make sure your files sort in order alphabetically+chronologically.

Checkpoints also are good if you're doing something major—like generating an Index. Makes it easy to jump back to the previous version just in case you messed up.

Quote:
Originally Posted by roland1 View Post
I used Affinity Publisher for the print book layout. Excellent and cheap, BUT it does not export to anything but PDF for now.
Never had the pleasure of using it.
  • And there's no HTML export?
  • What about RTF or something with a little bit of formatting it in?
  • Or is it really PDF export only?

What if you copy/paste the text out of Affinity + into another program (like LibreOffice):

Does it carry over the rich text formatting? (Italics, bold, hopefully indentation, etc.)

If it does, then that'll be a little easier. Then you won't have to manually add the italics back in.

Use that intermediate program to export your text to HTML.

Quote:
Originally Posted by roland1 View Post
re; Index: I assumed I would have to add the "See also" after finishing the index because it made sense that it would just get erased with each overwrite. I can live with that.

I'm taking the word for word link coding approach. Very tedious, but if I get through this process once, I won't have to do it again.
Don't manually add all that "See also" text.

Just work directly from the Print version you have + use regex (regular expressions) to linkify the page numbers.

I explain exact methods in those linked threads, but I'll reword it slightly differently here:

* * * * * *

How to Make Your Index Links

0. Make sure you insert all your page number code at every page break!

Code:
<p>This is an example<a id="page123"></a> of a split paragraph.</p>
1. Merge all your main chapter files into one huge HTML file. (You'll thank me later.)

I rename this super merged book file to:
  • merged.xhtml

(Make sure you keep the index.xhtml file separate.)

2. Open your Index file.

3. Ctrl+F to open Sigil's Search/Replace.

In the dropdowns, make sure these are selected:
  • Mode: Regex
  • Current File

+ uncheck the box for "Wrap".

4. Use regex to convert those dumb page numbers into links:

Search: (\d+)
Replace: <a href="../Text/merged.xhtml#page\1">\1</a>

- - - - - -

Side Note: If you don't know regular expressions... this is what all the parts are doing:

Search:
  • \d = any number
  • + = one or more of the previous thing
    • So in this case, "one or more of any number"
  • () around stuff = "Capture whatever's in here, and shove it in a 'group'."

Replace:
  • <a> = all the link code
  • \1 = The stuff we captured in the Search.

In Plain English, this regular expression is saying:

"Hey, look for any number, grab it, then point to that page number in the merged file."

- - - - - -

5. Click somewhere right before the very first entry, then press "Replace All".

This will go from my "Plain Index" -> "Linked Index" above:

Before:

Code:
cats, 5
dogs, 123
parrots, 200
After:

Code:
cats, <a href="../Text/merged.xhtml#page5">5</a>
dogs, <a href="../Text/merged.xhtml#page123">123</a>
parrots, <a href="../Text/merged.xhtml#page200">200</a>
6. Doublecheck and make sure you did all your page number code throughout your main text.

Now, you can split your chapter files again:

6.1. You can manually click before each of your chapters, then Insert > Split Marker (Ctrl+Shift+Return).

That button inserts this code:

Code:
<hr class="sigil_split_marker" />
6.2. (Alternate) Or you can insert all those markers in one shot by Search/Replace.

If you have proper Headings (or some common text that stands out):

Search: <h2>Chapter
Replace: <hr class="sigil_split_marker" /><h2>Chapter

then "Replace All".

Before:

Code:
<h2>Chapter 1<h2>
[...]
<h2>Chapter 2</h2>
Replace:

Code:
<hr class="sigil_split_marker" /><h2>Chapter 1<h2>
[...]
<hr class="sigil_split_marker" /><h2>Chapter 2</h2>
7. Edit > Split at Markers (F6).

Now all your chapter files will be resplit into separate HTML files again.

8. Rename all your HTML files to human-readable names:

- Chapter01.xhtml
- Chapter02.xhtml
- [...]
- Chapter99.xhtml

* * * * * *

Now, when you go back to your Index, you should see all your page number code updated:

Code:
cats, <a href="../Text/Chapter01.xhtml#page5">5</a>
dogs, <a href="../Text/Chapter15.xhtml#page123">123</a>
parrots, <a href="../Text/Chapter28.xhtml#page200">200</a>
And to doublecheck your links work... Press Tools > Reports > Links.

If you "Target exists?" column has any "no", then you know you have a broken link.

Side Note: For more explanation on what the columns mean, see my post from a few months ago:

Last edited by Tex2002ans; 12-13-2021 at 12:47 AM.
Tex2002ans is offline   Reply With Quote
Old 12-13-2021, 10:17 PM   #7
roland1
Connoisseur
roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.
 
roland1's Avatar
 
Posts: 80
Karma: 2137678
Join Date: Dec 2021
Location: Canada
Device: none
Thanks for the help. It's much appreciated.

Today I started using dictation with the index and that made it easier to find phrases in SIGIL (easier on my hands and faster, too).

I have one concern and you guys would know the scoop on this: why are some entries in the index (subtopics) not in alphabetic order?

Did I do something wrong? Are those reference numbers I see linked to their ordering in the index?

P.S. This is my first flowable text ebook, so I'm still getting my wings in terms of understanding the whole process. I did put out a fixed format kindle ebook, but that was easy. Not as work intensive as this.

Thanks again.

And I will probably have some questions once I get to the "see also" stage.
roland1 is offline   Reply With Quote
Old 12-13-2021, 10:20 PM   #8
roland1
Connoisseur
roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.
 
roland1's Avatar
 
Posts: 80
Karma: 2137678
Join Date: Dec 2021
Location: Canada
Device: none
Also, Affinity Publisher does not export to anything in the text based world, other than pdf - if I don't flatten the file.

When I created my ebook, I copied and pasted from the original APub doc. layout. I didn't copy and past anywhere else but into SIGIL because I thought I might get app-specific gremlins stuck to my text. Again, it's a learning curve and I'm sure my second time around will be smoother, or at least a lot more informed that I am now.
roland1 is offline   Reply With Quote
Old 12-14-2021, 10:12 AM   #9
roland1
Connoisseur
roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.
 
roland1's Avatar
 
Posts: 80
Karma: 2137678
Join Date: Dec 2021
Location: Canada
Device: none
Tex, thanks for the extended and insightful post on the page numbering. I just now read every bit of it and it all makes sense. (I read the linked content earlier). Kevin, your input is also helpful, although when perl and python are used in any sentence, my brain glazes over (any success in those areas has also ever been dumb luck for me; I learn from patterns but I never learned to actually speak the language of that code like others have —*I don't spend enough time with it to make studying it worthwhile.)

Here in this thread, the situation is like two responsible adults calling out to a kid who's walking into the riptide and you're yelling: "Don't walk into the riptide" and I'm like: "It's okay, I'm a strong swimmer." At this point, it's a matter of seeing how far I get sucked out into the ocean before I need to get rescued.

That said, I made it all the way to "F" in the Index last night. Cool Runnings!
roland1 is offline   Reply With Quote
Old 12-14-2021, 12:03 PM   #10
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,306
Karma: 13057279
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by roland1 View Post
Also, Affinity Publisher does not export to anything in the text based world, other than pdf [...].
Woof.

Quote:
Originally Posted by roland1 View Post
When I created my ebook, I copied and pasted from the original APub doc. layout. I didn't copy and past anywhere else but into SIGIL because I thought I might get app-specific gremlins stuck to my text.
Yes, usually that's an issue.

Gremlins getting introduced (especially when copying/pasting from outside sources).

But, in this specific case, it would be helpful to get the text + italics + (any other basic formatting) out.

So you could do something like this:
  • Copy index out of Affinity Publisher
  • Paste into LibreOffice/Word
    • (Optional) Adjust Styles to make sure indentation matches print book.
    • Save as ODT (or DOCX, or whatever)
  • Convert to HTML.
    • With a "Save As" or Calibre convert.
  • Copy/Paste index's HTML into Sigil.

Then you could do the usual cleaning of dirty HTML -> clean HTML:

Code:
<p class="calibre123">dogs, 123</p>
<p class="calibre456">Mammals. <i class="calibre1234">See also</i> Animals.</p>
After:

Code:
<p class="index">dogs, 123</p>
<p class="index">Mammals. <i>See also</i> Animals.</p>
You could use Doitsu's fantastic "TagMechanic" plugin to help with some of that cleanup.

(Or, if it's not so bad, just simple S&R. There shouldn't be too much HTML mess introduced.* [Famous last words.])

Side Note: You may even be able to paste your index directly from Affinity Publisher into Sigil's PageEdit. Perhaps the HTML code may be slightly cleaner.

Quote:
Originally Posted by roland1 View Post
Today I started using dictation with the index and that made it easier to find phrases in SIGIL (easier on my hands and faster, too).
??? What do you mean?

Quote:
Originally Posted by roland1 View Post
I have one concern and you guys would know the scoop on this: why are some entries in the index (subtopics) not in alphabetic order?

Did I do something wrong? Are those reference numbers I see linked to their ordering in the index?
Can you give an example + step-by-step instructions?

I think I get what you're saying, but not 100% sure.

You're talking out of Sigil's Index Editor + Create Index?

So while main entries like:

Code:
Animals
Mammals
Zoology
are getting sorted A->Z... the subentries aren't?

Code:
Animals
	giraffes
	zebras
	cats
	dogs
Mammals
Zoology
where you expect the subentries alphabetical as well?

Code:
Animals
	cats
	dogs
	giraffes
	zebras
Mammals
Zoology
Anyway, some:
  • HTML to reproduce the problem
  • Tools > Index > Index Editor, so we could see what "Text to Include" + "Index Entries" you have

would help.

Quote:
Originally Posted by roland1 View Post
P.S. This is my first flowable text ebook, so I'm still getting my wings in terms of understanding the whole process. I did put out a fixed format kindle ebook, but that was easy. Not as work intensive as this.
Fixed Layout (FXL) books are... ass. Those have also been discussed to death over the years.

99.99% of the time, you don't want them.

They are awful for actual, human readers, because they throw away all the advantages of actual ebooks:
  • Resizing/changing fonts
  • Adjusting margins
  • Easy copy/paste/search
  • Footnote popups
  • Perfectly fitting your device
  • [...]

With FXL, you have to pinch-zoom, pinch-zoom, scroll, pinch-zoom.

They don't sell. They don't work across devices. And they are the absolute worst of all worlds (even worse than just reading a PDF).

Quote:
Originally Posted by roland1 View Post
Tex, thanks for the extended and insightful post on the page numbering. I just now read every bit of it and it all makes sense.

Last edited by Tex2002ans; 12-14-2021 at 12:09 PM.
Tex2002ans is offline   Reply With Quote
Old 12-15-2021, 03:39 PM   #11
roland1
Connoisseur
roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.
 
roland1's Avatar
 
Posts: 80
Karma: 2137678
Join Date: Dec 2021
Location: Canada
Device: none
"Dictation" = voice to text. I speak the phrase into the "Find" field rather than type it.

I have dozens of references for one word, so I always have to type out the words around it so I can hone in on the exact page for that keyword. Speaking the text quickens the pace.

Also, when I read some posts about page numbering, I saw that it wasn't an easy fix. Some people were saying "How come the page numbers don't show up?" in device a/b or c. And then someone said "Well, you could use this plugin and..."

When I saw that I said to myself, I've got enough problems doing it the simple way. I figure the best road to go down is the one I'm familiar with, that way I know how to find may way back. I don't use that approach in life, but I use it for technical stuff like this.

Thanks again. Oh, and the index alphabetizing thing happens in subtopics. I have a psychological breakdown of fear indexed and it goes from "fear of" (f) to horror (h) and then to "fleeing" (back to f again). so it goes from f to h to f. I'm have to investigate more later.
roland1 is offline   Reply With Quote
Old 12-15-2021, 05:24 PM   #12
roland1
Connoisseur
roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.
 
roland1's Avatar
 
Posts: 80
Karma: 2137678
Join Date: Dec 2021
Location: Canada
Device: none
Just to be clear, I don't put "dozens" of references for one word into the index. I just mean that there can be dozens of instances of a word throughout the book and so I have to isolate it by identifying the words that surround it. Luckily, those words show up in the print file indexing window.
roland1 is offline   Reply With Quote
Old 12-16-2021, 05:09 AM   #13
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,306
Karma: 13057279
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by roland1 View Post
"Dictation" = voice to text. I speak the phrase into the "Find" field rather than type it.

I have dozens of references for one word, so I always have to type out the words around it so I can hone in on the exact page for that keyword. Speaking the text quickens the pace.
Yes, but I didn't understand exactly how you're using it. (Still don't.)

So let's say you have "dog, 123" in your index.

You're jumping to page 123's text, then trying to locate the exact word "dog" on that page? (And you're currently using dictation to speak the words instead?)

... and what exactly would the point of that be? If you're not marking indexed words within the original source document (Word, LO, etc.), seems like you're wasting lots of time.

Just link to the page #s, and move on.

(I explain more in-depth reasoning in the topic below.)

Quote:
Originally Posted by roland1 View Post
Also, when I read some posts about page numbering, I saw that it wasn't an easy fix. Some people were saying "How come the page numbers don't show up?" in device a/b or c.
RPNs are a complete pain in the neck to produce. Many ereaders won't display them.

... But within the past few years, I have slightly loosened on how useless they are in ebooks (for Accessibility reasons).

Quote:
Originally Posted by roland1 View Post
Thanks again. Oh, and the index alphabetizing thing happens in subtopics. I have a psychological breakdown of fear indexed and it goes from "fear of" (f) to horror (h) and then to "fleeing" (back to f again). so it goes from f to h to f. I'm have to investigate more later.
In Sigil's Index Editor + Create Index?

(But you shouldn't even have that problem, because you deleted all that and worked from the Print book's text. Right? )

But it would be good to get those Sigil bugs sorted. Not many people go poking around in the index tools.

Quote:
Originally Posted by roland1 View Post
Just to be clear, I don't put "dozens" of references for one word into the index. I just mean that there can be dozens of instances of a word throughout the book and so I have to isolate it by identifying the words that surround it. Luckily, those words show up in the print file indexing window.
If you haven't already, you'll also probably want to read through:

Post #129 is where I entered the picture, describing nearly every facet of Indexing/citations in ebooks.

(You could also start at #6, where Hitch began posting. But there's a massive amount of ranting/raving from other users... you'd probably gather all the real-life indexes-in-ebooks production by reading each of Hitch's posts + mine.)

Now, I don't know if you hired an actual Indexer to create your index. But, in many cases, those exact words/terms just won't show up within the text.

Indexes allow you to have more broad strokes or general terms. Here's one of the examples I gave in that post:

Quote:
Originally Posted by Tex2002ans View Post
"Ancestors, 3, 36, 145".

Great, I found the word "ancestors" in page 3, EASY. But wtf is this, I just read the entire page 36, and I don't see "ancestors" on the page.

You (as the converter) must now read/skim the ~400-800 words that constitute "page 36" to find what the Indexer ACTUALLY meant.

You have to look for all the related words: "ancestry" + "ancestor" + "ancestral". Maybe it just has an important sentence/paragraph that talks about ancestors indirectly (maybe talking about older relatives, or ancient civilizations).
I also covered the "looks-like-only-a-single-number, but multiple-times-per-page" problem too:

Quote:
Originally Posted by Tex2002ans View Post
Hard #2: "Keynes, John Maynard, 429, 464, 467, 468n, 546n, 737, 771, 785, 787, 846".

Keynes might be mentioned multiple times on a page. It just so happened to be because of the way the physical book was laid out (page margins, font, [...]), that Keynes was mentioned in the first + last paragraph on page 429, BUT, the middle paragraphs don't talk about him at all.

Where do I link? Do I link to that first paragraph? Do I link to the last paragraph too?

Keynes may also be mentioned quite a few times throughout the book on other pages, but it is just an unimportant/passing remark. This doesn't belong in the Index. In my searching/jumping around page numbers though, I STILL come across "Keynes" a hundred times, this takes time to sift through. (This is the problem of the Search/Concordance method + any sort of automated/semi-automated Indexing tools).
Tex2002ans is offline   Reply With Quote
Old 12-17-2021, 12:21 PM   #14
roland1
Connoisseur
roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.roland1 ought to be getting tired of karma fortunes by now.
 
roland1's Avatar
 
Posts: 80
Karma: 2137678
Join Date: Dec 2021
Location: Canada
Device: none
The dictation is used as follows:

In the finder field, I may type in the word "competition" —*however, it can show up dozens of times inside the book/html text so that I would have to press "find" continuously until I ran into it along the way.

And so, what I do instead is that I look at the original index marker from the book and type in words that surround that word —*which I speak into the finder field via dictation. In this way, I can land on the exact page where that reference is located. Saves me time. Some, anyway.

P.S. This particular book is 28 years in the making. A few days spent on an index is not going to make or break my resolve to get the job done.
roland1 is offline   Reply With Quote
Reply

Tags
index, see also, sigil


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
suggestion: add "apply" or "save" functionality in edit toc davidhcje Editor 1 08-20-2021 10:40 PM
t68 indexing books and failing to update "curently reading" Maartinezz123 Onyx Boox 0 01-17-2018 09:39 AM
"Add a book" template like "Save to disk"? vr8ce Library Management 10 06-09-2017 08:16 AM
A warning for Linux users: slow "Add Books", "Unknown" title and Author rolgiati Library Management 8 07-24-2013 04:36 PM
Question about disable indexing permanently by disabling access to "Search Indexes" WS64 Kindle Developer's Corner 1 12-17-2011 05:51 PM


All times are GMT -4. The time now is 01:20 PM.


MobileRead.com is a privately owned, operated and funded community.