11-15-2017, 09:29 AM | #61 | |
Wizard
Posts: 1,542
Karma: 6613969
Join Date: Mar 2013
Location: Rosario - Santa Fe - Argentina
Device: Kindle 4 NT
|
Quote:
|
|
05-08-2022, 03:55 PM | #62 |
Wizard
Posts: 1,496
Karma: 11250344
Join Date: Aug 2010
Location: NE Oregon
Device: Kobo Sage, Forma, Kindle Oasis 2, Sony PRS-T2
|
I know this thread is getting long in the tooth, but it did seem like the place to ask....
I'm working on the public domain book "Under the Lilacs" by Louisa May Alcott for the MobileRead library, going from a 1910 edition. (I know it's taking me forever, but I need SUN for my old eyes to proofread against print and haven't had much this year so far. Hopefully summer is coming...) This text is RIDDLED with trailing em-dashes of this sort: Code:
<p><snip text>that young person answered respectfully,—</p> I am using the Word Joiner HTML entity ( & # 8 2 8 8 ; ) between the comma and the em-dash. I'm assuming that there is no need for the word joiner between the word and the comma? What about an example like this, where there's a comma, right closing quotation mark and THEN an em-dash? Will all that hold together with *bags* if the joiner is used between the right quote and the em-dash or do I need to have the word joiner between the word AND each punctuation mark? Code:
<p><snip text>mumbling something about “bothering old bags,”—</p> Code:
<p>after the other—”</p> Thanks, y'all! Last edited by graycyn; 05-08-2022 at 04:06 PM. |
Advert | |
|
05-08-2022, 07:21 PM | #63 |
Resident Curmudgeon
Posts: 73,979
Karma: 128903378
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Is it correct to have the em-dash like it is ion the eBook? Does a printed copy of the 1910 edition do the same thing?
|
05-08-2022, 10:27 PM | #64 | |
Wizard
Posts: 1,496
Karma: 11250344
Join Date: Aug 2010
Location: NE Oregon
Device: Kobo Sage, Forma, Kindle Oasis 2, Sony PRS-T2
|
Quote:
While I plan to try and be true to the printed text, definitely not planning to retain 1910 era justification and punctuation spacing! It would neither be pretty or practical! Example (typed including all spacing or lack thereof as seen in print): Code:
" I shall have to go off early, so there won't be any chance. Yes, there will, — I 'll tell you how to do it. Let me read while I drive up the cows. Squire likes Notice the space between the opening quote and "I". And the space in "I'll" and the LACK of same in "won't" above. This happens throughout the entire book with contractions! Notice the space before and after the em-dash. Double space between sentences. That is how it appears on the page! But not in my ebook! I don't plan to sprinkle the word joiner entity about in the MIDDLE of sentences either, ONLY where sentences END with the abundantly trailing em-dashes. Reflowable text needs to flow, right? I'd just like to prevent the possibility of LOTS of orphan em-dashes. *IF* y'all think that's wise! Otherwise, I'll leave 'em be, if they break off, they break off. I swear, Alcott trails those em-dashes about like ivy down the wall hiding the door in The Secret Garden by Frances Hodgson Burnett ... thick and heavy |
|
05-08-2022, 10:37 PM | #65 | ||||||||
Wizard
Posts: 2,297
Karma: 12126329
Join Date: Jul 2012
Device: Kobo Forma, Nook
|
Quote:
Cut that down to 1 (or 0). This is one of the dangers of using some of these invisible Unicode characters:
Quote:
In ebooks, don't worry about quirks in EM DASH linebreaks. Leave it up to each device's/program's linebreak algorithms. Quote:
Quote:
But, if you had to pull my teeth, this is where WORD JOINER would be placed around the EM DASH:
* * * Side Note: Similar issue happens with odd linebreaking around the ELLIPSIS character + other punctuation. For more info on that, see my posts in: and all the linked threads which go into extreme detail. Side Note #2: Most devices handle the ELLIPSIS character properly though, where the EM DASH is much trickier, because many languages use them as "Quotation Dashes": You also have languages that use every single possible combination of quotation marks... so you can't just say EM DASH + RIGHT DOUBLE QUOTE stick together 100% of the time. Examples of EM DASHES in the middle of text are common, like:
so if you "fixed" linebreaking for dialogue, you'd "break it" for parentheticals! (Not to mention, the linebreaking algorithms should/would have to be context- + language-aware!) Yes, the printed copy has that. See Page 99 in this Archive.org version where graycyn's first example was. Looks like every piece of Dialogue got its own new paragraph, so they split:
Example: Quote:
Nowadays, it would most likely be done like this: Quote:
Quote:
Quote:
Last edited by Tex2002ans; 05-08-2022 at 11:10 PM. |
||||||||
Advert | |
|
05-09-2022, 12:31 AM | #66 | ||||
Wizard
Posts: 1,496
Karma: 11250344
Join Date: Aug 2010
Location: NE Oregon
Device: Kobo Sage, Forma, Kindle Oasis 2, Sony PRS-T2
|
Quote:
Quote:
Actually, when I'm doing books for myself, that's EXACTLY what I do. I *don't* really like micro-managing reflowable text! However, most books don't have quite the sheer numbers of trailing em-dashes that this particular title does. And I know people here get kind of picky about how ebooks look. Me, it doesn't bother me terribly if a line starts with an ellipsis or em-dash. I've gotten used to it. Which is why I was asking. Because once the text is proofed, I can present it however. Free flowing or with the possibility of fewer orphan em-dashes if folks prefer that. Quote:
Quote:
I'll think it over. I was noticing the orphan em-dashes more than usual, so I thought I should perhaps try to prevent those. Your examples are what I thought would be the right placement, *if* I decide to use word joiners. THANKS, I will check out your posts! I've been having fun making ebooks for myself out of my old childhood books, but those I can't share because of copyright. This title, I can! I like modern AND old styles. Not to the extent of trying to duplicate all the crazy spacing from 1910, but the little oddities like the commas and em-dashes add charm and give me that old book feeling. Likewise for to-days and to-morrows. I grew up with those reading my dad's books. Nostalgia.... Meanwhile, anyone who objects to me just letting the text flow, speak up. The book is for MobileRead, so I'm happy to do what folks prefer, if, of course there is any consensus at all on that. |
||||
05-09-2022, 05:38 PM | #67 | ||||||
Wizard
Posts: 2,297
Karma: 12126329
Join Date: Jul 2012
Device: Kobo Forma, Nook
|
Quote:
Quote:
Quote:
I copied/pasted your post into Notepad++ and saw very odd display issues on the WORD JOINER line. (Parts of the text appeared bold.) Then it had VERY odd:
I haven't ever tested the WORD JOINER inside of ebooks, across devices/apps, but similar bugs may carry over there too. (For example, when trying to search or drag your finger to highlight text for notes.) In one of those linked posts, I linked to the actual Unicode linebreaking specs. The EM DASH is in a unique class all on its own: Quote:
Quote:
For readability, certain punctuation got (differently-sized) space around it. Back then, there was more judicious usage of all the different types of spaces:
Throughout the 20th century—because of technological/economic reasons—most editing/typography boiled down further and further towards "no space" or "single space". Currently, the only real places these spaces still exist is:
* * * Side Note: If you want some interesting history on this, it's similar to the myth that "double space after period is wrong": In that blog post, he goes through and explains the older typography / Style Guides. For example, EM QUADs were put after sentence-ending periods. Side Note #2: It's only been recent that proper typographical spacing has been making a resurgence. LaTeX has been holding strong, enabling the older (better) style of typography. (See my Units post above.) And while searching for some info, I just ran across this article with some good examples:
Finally, the web is also gaining some of this stuff, slowly. And fonts are always getting better kerning inside (so a lot of this spacing issues can be handled more automatically). Quote:
Maybe yours can be the EPUB everyone points to on how to do them properly. (And use it to test across real-life devices.) Last edited by Tex2002ans; 05-09-2022 at 06:21 PM. |
||||||
05-09-2022, 05:50 PM | #68 |
Resident Curmudgeon
Posts: 73,979
Karma: 128903378
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
I doubt word joiners will work properly in most cases. I suggest youy test it first before going all out.
|
05-11-2022, 03:03 PM | #69 |
Wizard
Posts: 1,496
Karma: 11250344
Join Date: Aug 2010
Location: NE Oregon
Device: Kobo Sage, Forma, Kindle Oasis 2, Sony PRS-T2
|
My bad, sorry I called it "crazy spacing," Tex2002ans. I agree, it wasn't, it's more that it just felt inconsistent to me, what with spacing after an opening quote, but no spacing after a closing quote and then the variance of some contracted words having a space and some not!
As for the word joiner, I had done some testing and wasn't seeing any particular issues, but I didn't try testing search or highlighting. It's no biggie, I really DIDN'T want to put them in, it was more that there've been so many discussions of typography and what people like and how things should look here, I just wanted to offer the option. I mean, I've seen the assorted discussions here over ellipsis character vs. constructed ellipsis, the discussions over whether to use an em-dash or an en-dash and spaces, all that! I figure mostly, there's no right or wrong, just personal preference. The good news is that there are ZERO ellipses in Under the Lilacs! I stripped the word joiners out of the test chapter I put them in. They are gone, so there's that. Gotta run, much to do today, including hopefully update the file with another two chapters of corrections. |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
italic at end of line causes next line to start with a space | ebookreadr | Sigil | 29 | 03-20-2017 11:25 AM |
Preventing chemical symbol from being split by page or line break | Nick Payne | ePub | 8 | 10-23-2013 03:06 AM |
Line breaks on Kindle, no line breaks on 4 PC | Siavahda | Kindle Formats | 0 | 10-20-2012 05:50 AM |
Spurious Line Breaks | Halk | Workshop | 1 | 05-15-2010 01:22 PM |
No line breaks | ecpepper | Amazon Kindle | 3 | 08-09-2009 06:42 PM |