![]() |
#1 | ||
Bibliophagist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 44,446
Karma: 167726581
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
Audiobooks & AI
I got a chuckle today. I was looking at a link from an email and ran into this gem: How AI is Changing the Way We Read and Discover Books
Quote:
I did have to like one of the comments: Quote:
|
||
![]() |
![]() |
![]() |
#2 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,406
Karma: 52613881
Join Date: Oct 2010
Device: Kindle Fire, Kindle Paperwhite, AGPTek Bluetooth Clip
|
Audible is selling audiobooks narrated by "virtual voice." Most of them seem to be in the Plus catalog and/or quite inexpensive, and I suspect most of them are from self-published authors. It's a lot more clutter to wade through.
|
![]() |
![]() |
Advert | |
|
![]() |
#3 |
eReader Wrangler
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,800
Karma: 50741061
Join Date: Mar 2013
Location: Boise, ID
Device: PB HD3, GL3, Tolino Vision 4, Voyage, Clara HD
|
I can always tell when I'm listening to an AI voice on YouTube because they always mispronounce obvious words (usually screwing up long and short vowels in names of famous people). It's kind grating.
|
![]() |
![]() |
![]() |
#4 |
Weirdo
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 811
Karma: 11003000
Join Date: Nov 2019
Location: Wuppertal, Germany
Device: Tolino Shine Color, Tolino Vision 6, Kobo Clara 2E, Boox Note Air 2+
|
I think one has to differentiate 2 things here. Does anyone think that AI generated "audiobooks" can replace professionally narrated audiobooks? Not at the moment and probably not for quite some time.
Can AI generated "audiobooks" be used as a tool for people with visual impairments who couldn't otherwise access books that don't have a professional audiobook available? Probably yes. |
![]() |
![]() |
![]() |
#5 |
Bibliophagist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 44,446
Karma: 167726581
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
As long as it is clearly stated that the audio book was generated using AI/synthesized voices. So far, that does not seem to be the case as in one audio book one acquaintance recently purchased which has all the hallmarks of an AI generated book and lacks any credits for the voicing. Audible seems to think it is worth as much as a human voiced audio book.
|
![]() |
![]() |
Advert | |
|
![]() |
#6 | |
eReader Wrangler
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,800
Karma: 50741061
Join Date: Mar 2013
Location: Boise, ID
Device: PB HD3, GL3, Tolino Vision 4, Voyage, Clara HD
|
Quote:
|
|
![]() |
![]() |
![]() |
#7 | |
eReader Wrangler
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,800
Karma: 50741061
Join Date: Mar 2013
Location: Boise, ID
Device: PB HD3, GL3, Tolino Vision 4, Voyage, Clara HD
|
Quote:
|
|
![]() |
![]() |
![]() |
#8 | |
Still reading
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 13,625
Karma: 103503445
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
|
Quote:
That was 2010, nearly 15 years ago. I had TTS on DOS & Windows before 1995 and wasn't much worse. AI TTS may or may not be be using "AI". But no, nothing so far comes close. It may not ever because a good narrator undestands the text. AI has no understanding. |
|
![]() |
![]() |
![]() |
#9 | |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 844
Karma: 12122120
Join Date: Jul 2017
Device: Boox Nova 2
|
Quote:
The modern stuff is IMO a lot better: https://suno-ai.notion.site/Bark-Exa...2244ba45ebc2e2 |
|
![]() |
![]() |
![]() |
#10 | |
Still reading
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 13,625
Karma: 103503445
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
|
Quote:
The good TTS engine was bundled by Hauwei. An SCL-L01 apparently for the Polish market but sold as NOS in Ireland. Pop-in battery, 3.5mm jack socket, SD-Card & SIM slots accessible when battery (cell) popped out. The 2200 mAH cell dated Dec. 2016 (in ISO format). 720 x 1280 pixels. Android 5.1.1 The "flaw" of the mid 1980s TTS I had was it worked best with a custom text file. You could spell phonetically and also add voice modifiers. I wonder what exactly they mean by AI? Just more real speech sampled? I've an early IC that works with ASCII (and commands) on a simple micro-controller. It's certainly poor, but with a suitable text file not much worse than the DXG or Kindle gen3 Keyboard, or USB audio stuck on a Paperwhite 3. There were better PC TTS on XP 8 years before DXG. XP was decent by 2002. About the same time as Apple Mac OS9 was replaced by much better OSX (based on NeXt Step, based on BSD). The current Google "AI" effort is only better than PW3 TTS when using standard USA English texts. Picking a different voice doesn't fix non-UAS English. |
|
![]() |
![]() |
![]() |
#11 | |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 844
Karma: 12122120
Join Date: Jul 2017
Device: Boox Nova 2
|
Quote:
And it's hard disagree from me. The old TTS of 20-30 years ago is awful in comparison. Listen to the Microsoft SAM SAPI5 example on this Wikipedia page: https://en.wikipedia.org/wiki/Micros...-speech_voices. That's what XP was doing. It's not good compared to the more modern Google Android TTS and way way worse than the "AI" approaches. |
|
![]() |
![]() |
![]() |
#12 | |
Addict
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 212
Karma: 5115190
Join Date: Sep 2024
Device: Kobo Clara BW
|
Quote:
i honestly think AI is a complete white elephant, that hopefully will be just quietly dropped over the coming years. You see lots of flashy headlines saying AI can do this, AI can do that, but once you scratch the surface it usually turns out AI can't actually do whatever, or at least not very well. I was reading an article the other week/ month on how they are (or are planning too) using AI in spotting diseases (I think it was Cancer if I remember rightly) in blood samples. Sounds great except when you read the article the results were still being cross checked by humans as the AI made a lot of mistakes - so basically pointless, the medically trained humans might as well have just checked it in the first place. I know I'm in the very small minority (might just be me on my own ![]() ![]() * Just so there is no confusion, I will qualify my feelings on AI - I hate it with a passion ![]() |
|
![]() |
![]() |
![]() |
#13 | |
Still reading
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 13,625
Karma: 103503445
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
|
Quote:
The advances are marginal. There are several different aspects and so-called Neural Networks / AI don't do much more than the "voice creation". There is zero understanding and the huge overhead of a Neural Net is a brute force approach to issues like lead as in dog or boss and lead as in lining, sinker etc. *TTS is usable as an accessibility tool, but not as a replacement for a human narrator of Audio books*. At a pinch I can use TTS using Pocketbook on Android. It's certainly the best I've had yet. The offering Google has to "automate" audiobook production for sale is only marginally better for USA standard English texts and no better for anything else. Compare the Hobbit with it and Android phone, Kindle DXG and best XP with a trained human narrator! Selling audiobooks automatically generated is a kind of fraud. You can do nearly as well on your own phone. It's greed. |
|
![]() |
![]() |
![]() |
#14 | |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 844
Karma: 12122120
Join Date: Jul 2017
Device: Boox Nova 2
|
Quote:
|
|
![]() |
![]() |
![]() |
#15 |
Still reading
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 13,625
Karma: 103503445
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
|
I'd not listen to an entire book on any TTS. They don't come near human narration. If you are blind or partially sighted they are just about OK for a short story.
Blind & partially sighted have had human narrated audio books since the 19th C. Cassettes are better that CDs as quality is good enough and they remember position. The rise in audio-books for everyone as digital files is a boon to blind & partially sighted, and some are on our local library website (which is free). It was a shame that Audible was bought by Amazon and Amazon have hurt the competitors with their aggressive monopolistic marketing. Audible is too expensive and subscriptions are almost a rip-off for most people. So totally unacceptable that Audible is doing computer generated audiobooks. That's only marginally better than say Pocketbook on Android TTS. I agree that current TTS on a phone is a good bit better than Kindle DXG, kindle gen3 & PW3, but still nowhere near a human reader, especially if content is not standard ordinary USA English. The Kindles mentioned are poor and hardly any improvement on DOS in mid to late 1980s or Windows mid 1990s to today. Win10 is actually more awkward for a blind person than XP and offline speech not enough better. No good to listen to a novel for 4 hours. |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Chrome OS vs eBooks & Audiobooks | tubemonkey | General Discussions | 39 | 12-13-2013 03:45 PM |
New Forums -- Public Libraries & Audiobooks | tubemonkey | Feedback | 6 | 03-25-2013 11:34 PM |
Free audiobooks - Assorted Comedy & Agatha Christie [CD & MP3] | ATDrake | Deals and Resources (No Self-Promotion or Affiliate Links) | 11 | 08-29-2011 12:19 PM |
Two Free Audiobooks from The Guardian & Audible UK | koland | Deals and Resources (No Self-Promotion or Affiliate Links) | 9 | 09-09-2010 12:19 AM |