![]() |
#16 | |
Sir Penguin of Edinburgh
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 12,375
Karma: 23555235
Join Date: Apr 2007
Location: DC Metro area
Device: Shake a stick plus 1
|
Quote:
I think it's safe to assume that the indexes will be made during the ebook creation process. I would suggest that each index in an ebook be in a separate file(s). Given that Epub is basically zipped HTML, an index will likely consist of 1 or more links (that lead to other places in the ebook). I think we should consider copying the behavior of Mobipocket indexes. A link in the title index doesn't lead to the respective title. Instead, it leads to the beginning of the entry containing the title. (I also agree with Igorsk about the need for multiple head words.) In the keyword index, each keyword will listed once, and link to a separate file consisting of links to each of the entries containing the keyword. The links won't lead to the keyword, but to the beginning of the entry. |
|
![]() |
![]() |
![]() |
#17 | |
speaking for myself
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 139
Karma: 2166
Join Date: Feb 2008
Location: San Francisco Bay Area
Device: PRS-505
|
Quote:
Code:
<entry href="QU.html#queen">queen</entry> Code:
<dl id="queen"> <dt>queen<dt> <dd>a female sovereign or monarch</dd> </dl> Also, can we used any existing mark-up (e.g. XHTML or perhaps NCX) for index file? Should we just use XHTML-based index with some metadata marking it as such? I'll think a bit more about it. |
|
![]() |
![]() |
![]() |
#18 |
Sir Penguin of Edinburgh
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 12,375
Karma: 23555235
Join Date: Apr 2007
Location: DC Metro area
Device: Shake a stick plus 1
|
Actually, I was thinking of straightforward HTML for the index entry:
<a href="dictionary.html#d_somenumberX">dictionary</a><br /> It would have a corresponding link in the body of the ebook, of course. |
![]() |
![]() |
![]() |
#19 |
eBook Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 85,544
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
It's certainly no problem having multiple headwords for a single entry in Mobi dictionaries. The Chambers dictionary I have on my Gen3 will, for example, find words with variant spellings - eg "center" or "centre".
|
![]() |
![]() |
![]() |
#20 | |
speaking for myself
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 139
Karma: 2166
Join Date: Feb 2008
Location: San Francisco Bay Area
Device: PRS-505
|
Quote:
1. Add metadata tags (exact tags TBD) indicating that the EPUB is a dictionary, optional "input" language (the langauage that the dictionary articles are in is indicated by dc:language element), optional reference to the index file and optional collation declaration that describes the order of terms in the dictionary. 2. Dictionary should be split in multiple sections. In addition, an index file can optionally be provided. Index file should have linear="no" attribute in the spine. If an index is provideed, it should be referenced by the metadata. 3. Each entry in the dictionary must be formatted using XHTML dl tag. The first dt tag inside dl is considered to be a primary term. Dictionary entries must go in the order specified by collation - both inside a single section and across all sections as they are referenced in the spine. 4. Index is an XHTML file (exact structure TBD) that lists the sections of the dictionary itself (as opposed to supplementary material) and only the first term for each section. That both allows for efficient search and does not bloat the index. Peter |
|
![]() |
![]() |
![]() |
#21 | |||
Sir Penguin of Edinburgh
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 12,375
Karma: 23555235
Join Date: Apr 2007
Location: DC Metro area
Device: Shake a stick plus 1
|
Quote:
1, yes. 2, I think an index should be required due to the need for a speedy lookup. 3, The dl tags seem to be duplicating what we are trying to do with the XML tags, and can't get achieve the specificity desired . Why use both? 4, Let me expand on what I wrote before. A dictionary, for example, will have at a minimum title index(or something that will serve that purpose). It might also have one or more keyword indexes. The title index will be in its own file that is separate from the the rest of the book as well as being separate from the other indexes. Each index will be in a separate file (or files) from the other indexes. If there is more than one type of keyword (example: "famous people" & "famous places"), each type of keyword will have its own index with its own files. Here is where my explanation wasn't clear before. A keyword index, "famous people" for example, would be in the file "famous people_x.html". The entries would look like this: Quote:
Quote:
|
|||
![]() |
![]() |
![]() |
#22 | ||
speaking for myself
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 139
Karma: 2166
Join Date: Feb 2008
Location: San Francisco Bay Area
Device: PRS-505
|
Quote:
Quote:
On the other hand, keyword indices have to list every word (since they cannot rely on the document structure), but typically won't be as large (judging by the printed books). Also, in many cases, keyword index includes a short definition for each term, in addition to the link(s) to the book body. From that perspective, keyword indices are more similar to small dictionaries than to the title index. Finally, my instincts are to avoid br tag. Wrap it in p, li, dt - whatever - instead. Peter |
||
![]() |
![]() |
![]() |
#23 | |||
Sir Penguin of Edinburgh
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 12,375
Karma: 23555235
Join Date: Apr 2007
Location: DC Metro area
Device: Shake a stick plus 1
|
Quote:
Quote:
Also, while the entries of a dictionary are alphabetical, having a list of just headwords without the entries means you can look at and discard more entries at a time. This will make finding a word (with uncertain spelling) faster. Question: would it be possible to build the headword index into the toc.ncx file? If so, could it behave like an index? Quote:
|
|||
![]() |
![]() |
![]() |
#24 | |||
speaking for myself
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 139
Karma: 2166
Join Date: Feb 2008
Location: San Francisco Bay Area
Device: PRS-505
|
Quote:
Quote:
Quote:
|
|||
![]() |
![]() |
![]() |
#25 |
speaking for myself
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 139
Karma: 2166
Join Date: Feb 2008
Location: San Francisco Bay Area
Device: PRS-505
|
BTW, judging by the level of interest that this thread generated, people care about dictionaries even less than I thought ;-)
|
![]() |
![]() |
![]() |
#26 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
Quote:
Dale |
|
![]() |
![]() |
![]() |
#27 |
Fanatic
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 554
Karma: 2928497
Join Date: Mar 2008
Device: Clara 2E & Sage
|
I agree with Dale on this. Peter, perhaps you are thinking too much like an engineer and not enough like an average reader? As for interest in this thread, I have been following it closely, as I am sure others have.
|
![]() |
![]() |
![]() |
#28 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 644
Karma: 1242364
Join Date: May 2009
Location: The Right Coast
Device: PC (Calibre), Nexus 7 2013 (Moon+ Pro), HTC HD2/Leo (Freda)
|
I've been following this discussion since the beginning, although I have not commented. The discussion is about a format that I don't use (mobi/mobipocket), have no personal knowledge of, and provides detailed discussion of the programming of same.
In other words, I don't see that I can contribute to the discussion in any constructive manner. So... ![]() |
![]() |
![]() |
![]() |
#29 | |||
zeldinha zippy zeldissima
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 27,827
Karma: 921169
Join Date: Dec 2007
Location: Paris, France
Device: eb1150 & is that a nook in her pocket, or she just happy to see you?
|
Quote:
Quote:
Quote:
|
|||
![]() |
![]() |
![]() |
#30 |
Created Sigil, FlightCrew
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,982
Karma: 350515
Join Date: Feb 2008
Device: Kobo Clara HD
|
I would like to chime in with the rest and say that although I haven't added to the discussion, I'm following it closely. Dictionaries are important.
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Amazon extending DXG Returns? | nremondelli | Amazon Kindle | 3 | 08-01-2010 04:37 PM |
Proposal: Extending Epub with reference book tags | Nate the great | ePub | 31 | 10-16-2009 04:56 AM |
iLiad New ContentLister mockup proposal | Iñigo | iRex Developer's Corner | 9 | 12-08-2008 02:40 PM |
A homebrew proposal | DasFool | Sony Reader Dev Corner | 4 | 07-30-2008 05:45 AM |
Projects/files maintenance proposal | Alexander Turcic | Announcements | 6 | 10-26-2006 09:24 AM |