Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > ePub

Notices

Reply
 
Thread Tools Search this Thread
Old 12-08-2021, 02:27 AM   #16
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,297
Karma: 12126329
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by fabien.benoit.19 View Post
@Tex2002ans what tts software do you use actually?
On Android, I use PocketBook Reader to read my EPUBs, and just have the built-in "Google Text-to-Speech" as the chosen TTS engine.

Good enough for me.

Note: About a month ago, someone on Reddit recommended "Balabolka" to me, but I haven't tested it yet.

I've been wanting a TTS->MP3 program for quite a long time. If that can pull some of those higher quality, more human-sounding engines, that would be great. (And if I could listen on the computer... sugar on top!)

The only TTS->MP3s I tested in the past used the built-in Microsoft voices, and those were robotic-sounding junk.

If you're aware of some better TTS on Windows (local device preferably, no cloud stuff...), I'd be very interested.

Side Note: For the video and audio stuff... I actually use this on Android:
  • NewPipe to watch the Youtube videos
  • AntennaPod to listen to podcasts

Both of those have an option to "Skip Silence" + allow high (and fully controllable) speeds—NewPipe allows up to 3x speed, and AntennaPod can go up to 4x.

On Windows, similarly, the "cut silence" tools I'm aware of just don't work as smoothly/"magically" as those Android versions.

Last edited by Tex2002ans; 12-08-2021 at 02:32 AM.
Tex2002ans is offline   Reply With Quote
Old 12-08-2021, 06:42 AM   #17
Doitsu
Grand Sorcerer
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 5,584
Karma: 22735033
Join Date: Dec 2010
Device: Kindle PW2
Quote:
Originally Posted by Tex2002ans View Post
I've been wanting a TTS->MP3 program for quite a long time. If that can pull some of those higher quality, more human-sounding engines, that would be great. (And if I could listen on the computer... sugar on top!)
You might want to check out the following projects:

[GUI Plugin] TTS to MP3: Create MP3 audiobook using Windows TTS

Daisy Toby
Doitsu is offline   Reply With Quote
Advert
Old 12-08-2021, 08:18 AM   #18
fabien.benoit.19
Junior Member
fabien.benoit.19 began at the beginning.
 
fabien.benoit.19's Avatar
 
Posts: 7
Karma: 10
Join Date: Dec 2021
Location: Minsk, Belarus
Device: none
@Tex2002ans Well, that's a lot to me. Need to wrap my head around.

NewPipe and AntennaPod are both open source apps, so can be customized to support new features.

As of pocket book ereader. I'm a bit puzzled. There's a need for TTS->MP3 converter. In the end you are to have just an audio file. Is that enough for you? Cause the app perhaps allows to switch between text and audio when you need that, tracking current position for you.

@Doitsu
Do you listen to podcasts or audio books yourself?
fabien.benoit.19 is offline   Reply With Quote
Old 12-08-2021, 10:51 AM   #19
kevn57
Connoisseur
kevn57 began at the beginning.
 
kevn57's Avatar
 
Posts: 82
Karma: 10
Join Date: Jun 2011
Location: Albany NY
Device: Moonreader+
Quote:
Originally Posted by Tex2002ans View Post

Note: About a month ago, someone on Reddit recommended "Balabolka" to me, but I haven't tested it yet.

I've been wanting a TTS->MP3 program for quite a long time. If that can pull some of those higher quality, more human-sounding engines, that would be great. (And if I could listen on the computer... sugar on top!)
I used to use the Calibre Viewer but now I use Balabolka portable.

http://balabolka.site/balabolka.htm

It takes the ebook, displays a plain text, you have a choice of voices both installed and online(Google, IBM and some more). It highlights the text as it's read so it's easy to follow along. It can create mp3 files with subtitles, that can then be made into a video with text that follows along the voice. Or the mp3 can be played in an Audio player that displays lyrics.

In the status bar it gives you info how far % you are in the book, how long you have been reading and estimates how long to finish the book. It reads many formats as well as the ability to paste text to be read.
kevn57 is offline   Reply With Quote
Old 12-08-2021, 10:59 AM   #20
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,297
Karma: 12126329
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by Doitsu View Post
You might want to check out the following projects:

[GUI Plugin] TTS to MP3: Create MP3 audiobook using Windows TTS
Heh, yes, yours is one of the TTS->MP3 Plugins I was thinking of.

You're the one who initially helped when I was discussing EPUB->TTS->MP3.

Quote:
Originally Posted by Doitsu View Post
Thanks. Wasn't aware of this.

* * *

(Edit: kevn57 responded as I already typed my entire post, so I'll leave it as originally written.)

Quote:
Originally Posted by kevn57 View Post
I used to use the Calibre Viewer but now I use Balabolka portable.

[...]

It takes the ebook, displays a plain text, you have a choice of voices both installed and online(Google, IBM and some more). [...]
Fantastic. I'll definitely look into Balabolka soon.

* * *

Quote:
Originally Posted by fabien.benoit.19 View Post
As of pocket book ereader. I'm a bit puzzled. [...] Cause the app perhaps allows to switch between text and audio when you need that, tracking current position for you.
Yep. PocketBook highlights all the text as it reads along. You can also hop around the ebook (or manually highlight a line text), then have it begin speaking from that exact location.

For me, some issues are:
  • TTS can only be played on that specific (Android) device.
    • With an MP3, you can take that and carry it anywhere.
      • For example, playing it in the car.
      • Or playing the MP3 on Windows (in my usual music player), with the superior Android voice.
  • TTS via PocketBook is nearly real-time. (Wasting CPU/Battery.)
    • Let's say I have a 100k word book. In PocketBook, the phone would take hours to speak sentence-by-sentence:
      • 100k words / 150 words per minute = 11.1 hours.
    • If you were exporting directly to MP3, you'd be able to generate audio as fast as the CPU can go.
      • Afterwards, playing the final MP3 would barely take any CPU/battery.

So, for example, it would be nice if I could set the basic TTS settings, like:
  • Speed
    • Dense material, like a medical text, or topics I'm unfamiliar with, I'd have to slow way down.
    • A basic news article I want to read through, I can bump it up.
  • Pitch
  • Voice
    • I prefer the default Google TTS Female American English voice.
  • [...]

Then create a finely tuned MP3 just for me.

Then I could take that MP3 and play it anywhere:
  • Computer
    • With superior speakers
    • Or on wired headphones (my damn phone has no headphone jack).
  • Phone
    • If superior MP3 was generated on computer, I'd be able to move MP3 here + listen to it while in bed, doing chores around the house, etc.
  • Car
    • Won't be fiddling with controls, speed/pitch, etc. + can easily play/pause using car controls, just like music.
    • You'd be able to create a whole MP3 playlist, like 12 articles, and have them already prepped for a trip.
  • [...]

With an older version of Android (8??), before Google crippled the API to record TTS->MP3, you used to be able to do this.

But that's right around the time I obsessively dove into all this TTS + podcasting stuff, and began to seriously pay attention to the whole audio side of books!

Quote:
Originally Posted by fabien.benoit.19 View Post
There's a need for TTS->MP3 converter. In the end you are to have just an audio file. Is that enough for you?
Yep. Just getting the MP3, with higher-quality voices, would be fantastic.

Doesn't matter to me if it's Android or Windows-based, as long as I get that high-quality audio!

And for icing on top:

If it could understand basic HTML or EPUB, then it could do nice things like an MP3-per-chapter, pause slightly while reading Headings, etc.

Side Note: Firefox's Reader Mode, on desktop, has a TTS button to read websites:

Click image for larger version

Name:	Firefox.Reader.Mode.Listen.png
Views:	148
Size:	62.8 KB
ID:	190696

But, sadly, the Firefox Android version doesn't have this! So, again, you're limited to the crappy robotic Windows voices.

Side Note #2: I haven't ventured into full Screen Reader programs yet, like JAWS or NVDA.

Perhaps these things already handle some of my use-cases, but from what I gather, these programs drastically change the way things work + introduce performance regressions (like Firefox slowing down due to all the extra Accessibility overhead).

(You can go into extreme customizability with these tools though... like reading advanced HTML + skipping reading out URLs, making specialized noises for <em> or <i>, etc.)

Side Note #3: This MP3-generation may be where "Balabolka" can come into play. Like I mentioned above, I haven't had the chance to fiddle around with it yet.

According to that "Balabolka" video aariatui on Reddit recommended to me, it looks promising for a piece of my use-cases.

Last edited by Tex2002ans; 12-08-2021 at 11:18 AM.
Tex2002ans is offline   Reply With Quote
Advert
Old 12-08-2021, 11:21 AM   #21
fabien.benoit.19
Junior Member
fabien.benoit.19 began at the beginning.
 
fabien.benoit.19's Avatar
 
Posts: 7
Karma: 10
Join Date: Dec 2021
Location: Minsk, Belarus
Device: none
Let's say if you have a budget like 300$ per month, what is the best paid software for the purpose?
fabien.benoit.19 is offline   Reply With Quote
Old 12-08-2021, 11:30 AM   #22
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,297
Karma: 12126329
Join Date: Jul 2012
Device: Kobo Forma, Nook
No idea about the commercial side. Like mentioned in Post #6, there is the yearly Interspeech conference.

That's where a lot of the bleeding-edge audio generation research gets discussed.

And so much of the higher-quality TTS has shifted towards cloud-based, then charge users per word.

(I believe tools like Balabolka exploit the free "demo" sections on Amazon Polly [IBM, Microsoft, etc.], by sending small snippets of text. No idea if you get rate-limited or what when feeding entire books in there. Usually those demos limit you to a few hundred characters at a time.)

Last edited by Tex2002ans; 12-08-2021 at 11:56 AM.
Tex2002ans is offline   Reply With Quote
Old 12-08-2021, 12:01 PM   #23
fabien.benoit.19
Junior Member
fabien.benoit.19 began at the beginning.
 
fabien.benoit.19's Avatar
 
Posts: 7
Karma: 10
Join Date: Dec 2021
Location: Minsk, Belarus
Device: none
What's about podcasts and books themselves. Does it take much per month for you?
fabien.benoit.19 is offline   Reply With Quote
Old 12-08-2021, 12:01 PM   #24
Quoth
the rook, bossing Never.
Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.
 
Quoth's Avatar
 
Posts: 11,161
Karma: 85874891
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
Quote:
Originally Posted by fabien.benoit.19 View Post
Let's say if you have a budget like 300$ per month, what is the best paid software for the purpose?
Crazy. If it's not free or a cheap once off purchase it's inaccessible to the majority of partially sighted or blind.

Cloud is absolute stupidity for it, it needs to be local. That's usually an attempt to monetise people's activities.

Unless you are laying out ICs or similar, renting software is mugs game. Paid SW for ordinary people should be less than $200, for once off. Not an annual licence.

Other than high end CAD/CAE, and some high end publishing software used by only a few specialists, no software should be rental, and only as high as $200 (once off) for high quality software. Some companies have got very greedy with their high price rental models, not even justified by quality.

Last edited by Quoth; 12-08-2021 at 12:05 PM.
Quoth is offline   Reply With Quote
Old 12-08-2021, 12:04 PM   #25
fabien.benoit.19
Junior Member
fabien.benoit.19 began at the beginning.
 
fabien.benoit.19's Avatar
 
Posts: 7
Karma: 10
Join Date: Dec 2021
Location: Minsk, Belarus
Device: none
@Quoth
There's an open source version trying to compete with cloud tts.
https://github.com/snakers4/silero-models
fabien.benoit.19 is offline   Reply With Quote
Old 01-05-2022, 01:28 AM   #26
fabien.benoit.19
Junior Member
fabien.benoit.19 began at the beginning.
 
fabien.benoit.19's Avatar
 
Posts: 7
Karma: 10
Join Date: Dec 2021
Location: Minsk, Belarus
Device: none
@Quoth, @Tex2002ans, @kvn57
I can provide an offline tts solution for free, if you help with beta testing. Are you interested?
fabien.benoit.19 is offline   Reply With Quote
Old 01-09-2022, 05:15 AM   #27
Notjohn
mostly an observer
Notjohn ought to be getting tired of karma fortunes by now.Notjohn ought to be getting tired of karma fortunes by now.Notjohn ought to be getting tired of karma fortunes by now.Notjohn ought to be getting tired of karma fortunes by now.Notjohn ought to be getting tired of karma fortunes by now.Notjohn ought to be getting tired of karma fortunes by now.Notjohn ought to be getting tired of karma fortunes by now.Notjohn ought to be getting tired of karma fortunes by now.Notjohn ought to be getting tired of karma fortunes by now.Notjohn ought to be getting tired of karma fortunes by now.Notjohn ought to be getting tired of karma fortunes by now.
 
Posts: 1,515
Karma: 987654
Join Date: Dec 2012
Device: Kindle
>correct typesetting is "1,000 km" and "12 m"

That's the way I would write it, though with a full stop, but British books, especially from academic presses, often omit the space (and the full stop of course). The book I'm reading at the moment is from Cambridge University Press, by a German-educated Cambridge don.
Notjohn is offline   Reply With Quote
Old 01-09-2022, 06:23 AM   #28
Jellby
frumious Bandersnatch
Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.
 
Jellby's Avatar
 
Posts: 7,516
Karma: 18512745
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
Fullstop as in an abbreviation? That's wrong, units are symbols, and are written without fullstop (but there would be a fullstop if that's the end of a sentence, of course).
Jellby is offline   Reply With Quote
Old 01-09-2022, 09:44 PM   #29
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,297
Karma: 12126329
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by fabien.benoit.19 View Post
@Quoth, @Tex2002ans, @kvn57
I can provide an offline tts solution for free, if you help with beta testing. Are you interested?
Sure. Send me a PM.

Don't know if I'll be able to get time to test it out in-depth, but let me know more info.

Quote:
Originally Posted by Jellby View Post
Fullstop as in an abbreviation? That's wrong, units are symbols, and are written without fullstop (but there would be a fullstop if that's the end of a sentence, of course).


Quote:
Originally Posted by Notjohn View Post
The book I'm reading at the moment is from Cambridge University Press, by a German-educated Cambridge don.
From a quick look, Cambridge University Press's own Style Guide says this:

Quote:
Measurements

Avoid leaving a space between number and unit of measurement (10km, not 10 km). Commonly used measurements include: cm, mm, km, g, l (litre), MB, GB, kHz, MHz, KB. Follow the International System of Units (SI) as a guide for units of measurement. If your source material uses imperial units, always include the metric equivalent in brackets.
but they are wrong.

SI Units should always be typeset with the space (non-breaking and/or thin).

(For more info on that, see "The International System of Units (SI)" standards (PDF).)

Non-breaking Spaces

"10 km" is considered a single chunk, so it would be awful if broken across lines such as:

Code:
The person ran 10
km home.
Typesetting with non-breaking spaces alleviates this problem.

Thin Spaces

The other issue is a normal space is "too large"—making "number + Units" look like 2 separate words—so a thin space between allows them to look like a single, cohesive whole:
  • 10 km (space = Okay/Acceptable)
  • 10 km (thin space = Best)
  • 10km (no space = Wrong)

This typographical difference becomes much more apparent when working on documents with:
  • many units
    • Including temperatures!
  • equations/formulas
    • Especially inline!
  • large (complicated) units
    • kg/K = "kilograms per kelvin"
    • m/s^2 = "meters per second squared"
    • kJ/kg·K = "kilojoules per kilogram per kelvin"
  • + large (complicated) numbers
    • 1.0635kJ/kg·K (Wrong)
    • 1.0635 kJ/kg·K (Correct)
    • 1.0635 kJ/kg·K (Acceptable)
  • lots of numerical data.

Across single sentences, it doesn't make much difference, but across entire paragraphs + books, it begins to add up.

Here's a random page I ripped out a Thermodynamics book I worked on:

Thin Space vs. No Space vs. Full Space

Click image for larger version

Name:	Units.Spacing[ThinSpace].png
Views:	152
Size:	612.6 KB
ID:	191482Click image for larger version

Name:	Units.Spacing[NoSpace].png
Views:	139
Size:	611.4 KB
ID:	191480Click image for larger version

Name:	Units.Spacing[FullSpace].png
Views:	131
Size:	608.2 KB
ID:	191479

- - -

Side Note: This example was just what I had on hand + whipped up quickly.

Of course, this bad-spacing-between-units issue would look much worse in Word/ebooks. (LaTeX does a good job hiding it.)

- - -

(Inline) Equation Spacing

Similar typesetting needs to be taken account with Maths formulas too.

Take the equation "x MINUS y EQUALS negative three z":

Click image for larger version

Name:	Hyphen.vs.Minus.-.Mathematical.Spacing.png
Views:	128
Size:	30.3 KB
ID:	191478
  • x-y=-3z (No Italics = Wrong)
  • x-y=-3z (Hyphens = Wrong)
  • xy=−3z (Minus Signs = Better)
  • xy = −3z (+Spacing = Best)

Again, the entire equation is "a single chunk", so:
  • shouldn't be broken across lines (if possible).
    • If there is a linebreak, then it can only be allowed before certain symbols (+, −, =, ...)
  • Mathematical spacing rules apply.

Here's the sample page above...

Without unit+equation spacing vs. the final "thin spaced" book:

Click image for larger version

Name:	Units.Spacing[NoSpacePlusMath].png
Views:	125
Size:	605.7 KB
ID:	191481 Click image for larger version

Name:	Units.Spacing[ThinSpace].png
Views:	152
Size:	612.6 KB
ID:	191482

Especially see "S1" and "S2" in the lower paragraph (GIF comparison):

Click image for larger version

Name:	Units.Spacing.Compare.[ThinSpace.vs.NoSpace].gif
Views:	138
Size:	464.8 KB
ID:	191484

See the readability difference? Now times that by hundreds of pages.

Quote:
Originally Posted by Notjohn View Post
That's the way I would write it, though with a full stop, but British books, especially from academic presses, often omit the space (and the full stop of course). The book I'm reading at the moment is from Cambridge University Press, by a German-educated Cambridge don.
I think this is similar to the editor's (wrong) rule of "get rid of all double spaces".

That rule was morphed and applied because of decades of typographically-inferior technology:
  • typewriter
    • Only had a single type of space.
  • early computers / word processing programs
  • bad/piss-poor hyphenation/justification algorithms

but there's no need to keep forcing that "single space rule" when we have much better tech now.

Cambridge's editors probably got sick of Word (and similar programs):
  • breaking units across lines
  • adding huge (and inconsistent) gaps
    • Bad justification!

so they decided to go full-insane and combine the numbers+units together...... instead of selectively applying non-breaking spaces.

- - -

Side Note: See my related discussion about non-breaking spaces between initials+last names:
  • 2021: "RegEx: Insert nbs between initials, etc."
    • Antidote can detect+insert non-breaking (or thin) spaces between:
      • Units
      • Initials
      • Punctuation (French)
      • (+ many other cases)
    • LanguageTool can detect+add a space between:
      • Units
      • (Is aware of other cases. May be added some time in the future.)

And see the double-spacing discussion way back in:

Last edited by Tex2002ans; 01-10-2022 at 01:54 AM. Reason: Added GIF.
Tex2002ans is offline   Reply With Quote
Old 01-25-2022, 06:25 PM   #30
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,297
Karma: 12126329
Join Date: Jul 2012
Device: Kobo Forma, Nook
I ran across the EPUB3.3 specs today.

Seems like latest draft was released a few days ago (January 22, 2022):

In the EPUB3 TTS document, they discuss using SSML + CSS Speech.

And, perfectly in line with this thread's initial question, they describe:

"How to get TTS to pronounce certain things?"

By using "Pronunciation Lexicons".

You'd use <lexicon> + optionally supply a separate PLS file:

Quote:
EXAMPLE 5. The following example shows a pronunciation lexicon for Japanese.

Code:
<lexicon
     version="1.0"
     alphabet="ipa"
     xml:lang="en"
     xmlns="http://www.w3.org/2005/01/pronunciation-lexicon">
   <lexeme>
      <grapheme>EPUB</grapheme>
      <phoneme>ipʌb</phoneme>
   </lexeme>
   …
</lexicon>
Quote:
Example 6

The following example shows two pronunciation lexicons (one for Mandarin and one for Mongolian) associated with an XHTML Content Document.

Code:
<html … >    
    <head>
        …
        <link rel="pronunciation" type="application/pls+xml" hreflang="cmn" href="../speech/cmn.pls"/>
        <link rel="pronunciation" type="application/pls+xml" hreflang="mn" href="../speech/mn.pls"/>
    </head>        
    …
</html>
More technical info on PLS files can be found here:

Now, how likely it'll be that actually trickles down into actual ebooks + ereaders??? That remains to be seen...

Last edited by Tex2002ans; 01-25-2022 at 06:35 PM.
Tex2002ans is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
PRS-T1 Text To Speech Reader heinblöd Sony Reader Dev Corner 1 11-20-2017 03:35 PM
Kindle Touch - Using text-to-speech to pronounce individual words nosfera2 Kindle Developer's Corner 0 02-10-2012 11:59 AM
iPad VBookz - A Text-to-Speech Reader scottjl Apple Devices 3 10-23-2010 10:50 AM
Request Add Text To Speech to the Reader kenjennings enTourage Archive 12 07-26-2010 08:47 AM
Any Reader With Text-To-Speech Besides Kindle? ginakra Which one should I buy? 12 10-17-2009 10:41 AM


All times are GMT -4. The time now is 11:31 AM.


MobileRead.com is a privately owned, operated and funded community.