Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Sigil > Plugins

Notices

Reply
 
Thread Tools Search this Thread
Old 01-23-2016, 04:15 PM   #16
DiapDealer
Grand Sorcerer
DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.
 
DiapDealer's Avatar
 
Posts: 27,547
Karma: 193191846
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
Don't mean to butt in ... just wanted to say that many times, memory errors involving large file parsing/processing and Python can often be remedied by making sure you're using a 64-bit version of Python (and have tons of RAM if possible). You may be already, but I just wanted to mention it.

Last edited by DiapDealer; 01-23-2016 at 04:37 PM.
DiapDealer is offline   Reply With Quote
Old 01-23-2016, 04:22 PM   #17
AlPe
Digital Amanuensis
AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.
 
AlPe's Avatar
 
Posts: 727
Karma: 1446357
Join Date: Dec 2011
Location: Turin, Italy
Device: Several eReaders and tablets
Quote:
Originally Posted by eclipse123 View Post
I tried a few audio books - each one is one very long file (6-12 hours) and they all get the same problem, when I try shorter audio files, they work fine.
That explains the issue.

Yes, at the moment aeneas is limited by the amount of RAM your machine has. From https://github.com/readbeyond/aeneas...ssing-features :

Quote:
No protection against memory trashing if you feed extremely long audio files
On my 4 GB RAM laptop, the longest single file I was able to run was about ~1h30m --- the exact size depends on your OS consumption of RAM, other programs running concurrently, etc. (Also on aeneasweb.org there are 4 GB of RAM.)

So, the only thing you can do at the moment consists in splitting that audio file into several pieces (chapters?), and create a task (= audio/text pair) for each.

In theory this memory issue can be worked around --- instead of loading the entire WAVE file in memory, the MFCCs can be computed in "stream mode", and the DTW computed piece-wise --- but that would require a substantial amount of work that I cannot afford to do now.

Thanks for reporting this problem, I will clarify that in the docs, and add a check in aeneas to trigger a human-readable exception.

EDIT: logged as a new issue: https://github.com/readbeyond/aeneas/issues/57

Last edited by AlPe; 01-23-2016 at 04:31 PM.
AlPe is offline   Reply With Quote
Old 01-24-2016, 05:39 AM   #18
eclipse123
Member
eclipse123 began at the beginning.
 
Posts: 13
Karma: 12
Join Date: Feb 2010
Device: X51V
Thanks DiapDealer, AlPe,

I a have 8GB RAM, and windows usually knows to handle more then that if needed. However - it is not limitless - so memory management in Aeneas can definitely be the cause of this issues.

So - next problem :-):
I tried a different book, with smaller files per chapter. and this time I passed the Memory Hurdle and was able to progress. It started processing the first audio file properly, but crashed in segment 273: see the error below.

Any idea?

Another question - what happens if the audio file have slightly different text then the xhtml file? will it cause the software to exit? (I don't think this is the case, but just wondering)


[DEBU] ESPEAKWrapper: Calling with arguments 'espeak -v en -w c:\users\eyal\appdata\local\temp\tmpnph_hj.wav'
[DEBU] ESPEAKWrapper: Calling with text 'The small king suddenly smiled. Γא£Truth be told, I stopped thinking of it as a curse years ago, lad.Γא¥ '
[DEBU] ESPEAKWrapper: Calling espeak ... done
[DEBU] AudioFileMonoWAVE: Loading audio data...
[DEBU] AudioFileMonoWAVE: Sample length: 6.717914
[DEBU] AudioFileMonoWAVE: Sample rate: 22050.000000
[DEBU] AudioFileMonoWAVE: Audio format: pcm16
[DEBU] AudioFileMonoWAVE: Loading audio data... done
[DEBU] ESPEAKWrapper: Duration of 'c:\users\eyal\appdata\local\temp\tmpnph_hj.wav': 6.717914
[DEBU] ESPEAKWrapper: Synthesizing using pure Python... done
[DEBU] ESPEAKWrapper: Removing temporary file 'c:\users\eyal\appdata\local\temp\tmpnph_hj.wav'
[DEBU] ESPEAKWrapper: Synthesizing text... done
[DEBU] ESPEAKWrapper: Fragment 273 starts at: 2375.777098
[DEBU] ESPEAKWrapper: Fragment 273 duration: 6.717914
[DEBU] AudioFileMonoWAVE: Appending audio data...
[DEBU] AudioFileMonoWAVE: audio data is not None: returning True
[DEBU] ESPEAKWrapper: Synthesizing using pure Python... failed
[WARN] ESPEAKWrapper: An unexpected exception occurred while running pure Python code:
[WARN] ESPEAKWrapper:
[CRIT] ExecuteTask: STEP 3 FAILURE
[DEBU] ExecuteTask: Cleaning up...
[DEBU] ExecuteTask: Removing file 'c:\users\eyal\appdata\local\temp\tmpk2ro99.wav'
[DEBU] ExecuteTask: Cleaning up... done
[CRIT] ExecuteJob: Error while executing task 't000001.xhtml': The given log message is not a Unicode string
[CRIT] CLI: An unexpected Exception occurred while executing the job:
[ERRO] An unexpected Exception occurred while executing the job:
[CRIT] CLI: Error while executing task 't000001.xhtml': The given log message is not a Unicode string
[ERRO] Error while executing task 't000001.xhtml': The given log message is not a Unicode string
[DEBU] CLI: Execution completed with code 1
eclipse123 is offline   Reply With Quote
Old 01-24-2016, 07:27 AM   #19
AlPe
Digital Amanuensis
AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.
 
AlPe's Avatar
 
Posts: 727
Karma: 1446357
Join Date: Dec 2011
Location: Turin, Italy
Device: Several eReaders and tablets
Since you are running Windows, espeak is called via subprocess (since the cew C extension is not available), generating a temporary WAVE file for each fragment. Then, the contents of the latter are read and appended to create the global WAVE file.

From the log, it seems that the "append" operation failed, I cannot tell why from the log. It should not be a memory issue, as I see that your audio is below 1h of duration.

This string looks suspicious:

Quote:
'The small king suddenly smiled. Γא£Truth be told, I stopped thinking of it as a curse years ago, lad.Γא¥ '
as it seems to contain spurious characters ("Γא£", "Γא¥") --- albeit I cannot think a reason why it should make aeneas fail. In fact, from the log it seems that espeak synthesized it correctly. Anyway, you might want to clean that text and see if it helps.

===

As I wrote, Windows at the moment is not officially supported by aeneas, mainly because I am the only developer and I do not use Windows.

If you want to run aeneas on your Windows machine, I strongly recommend using VirtualBox + Vagrant: https://github.com/readbeyond/aeneas-vagrant
AlPe is offline   Reply With Quote
Old 01-24-2016, 08:00 AM   #20
AlPe
Digital Amanuensis
AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.
 
AlPe's Avatar
 
Posts: 727
Karma: 1446357
Join Date: Dec 2011
Location: Turin, Italy
Device: Several eReaders and tablets
Quote:
Originally Posted by eclipse123 View Post
Another question - what happens if the audio file have slightly different text then the xhtml file? will it cause the software to exit? (I don't think this is the case, but just wondering)
aeneas, doing forced alignment via MFCC+DTW and not doing speech recognition, is "robust against misspelled/mispronounced words, local rearrangements of words, background noise/sporadic spikes". (From: https://github.com/readbeyond/aeneas...orted-features )
AlPe is offline   Reply With Quote
Old 01-26-2016, 07:00 AM   #21
eclipse123
Member
eclipse123 began at the beginning.
 
Posts: 13
Karma: 12
Join Date: Feb 2010
Device: X51V
Thanks AlPe,
yes - I think the next step is to take this to linux...
I'll try that in a month or so when I have a bit more time. thanks a lot for your help and pointers so far.

BTW, I did try a few ebook readers with some sample books from your site, and as you noted, none of them support this format :-(

Thanks
Eyal
eclipse123 is offline   Reply With Quote
Old 01-28-2016, 02:54 PM   #22
AlPe
Digital Amanuensis
AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.
 
AlPe's Avatar
 
Posts: 727
Karma: 1446357
Join Date: Dec 2011
Location: Turin, Italy
Device: Several eReaders and tablets
You can install VirtualBox, create a Linux (Debian or Ubuntu) virtual machine, and run aeneas there.

aeneas-vagrant automates the setup, you just need to have VirtualBox and Vagrant installed.

=== === ===

BTW, I plan to release the next version of aeneas by the end of February 2016. I will rewrite the C extension that computes the MFCCs so that they will be computed in "stream mode", without requiring the whole WAVE file to be load in memory. This way aeneas should be able to process huge files.

(Note: in theory, for RIFF WAVE PCM 16bit 16 kHz --- the minimum setting for achieving good results --- the audio file cannot be longer than ~37.28 hours.)

=== === ===

Reading systems: yes, the app landscape is sad and disappointing, especially considering the fact that e.g. the DAISY Consortium voted for (reflowable) EPUB 3 + Media Overlays as the evolution of their DAISY Talking Book format.
AlPe is offline   Reply With Quote
Old 02-13-2016, 07:50 AM   #23
AlPe
Digital Amanuensis
AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.
 
AlPe's Avatar
 
Posts: 727
Karma: 1446357
Join Date: Dec 2011
Location: Turin, Italy
Device: Several eReaders and tablets
Added icarus 0.0.3. Please use Sigil 0.9.3 and ePub3-itizer 0.3.6 with it. (Otherwise stick with Sigil 0.9.2 and ePub3-itizer 0.3.5 and icarus 0.0.2)

Also note that today I released a new version of aeneas (1.4.1) and updated the http://aeneasweb.org Web App (0.0.8).
AlPe is offline   Reply With Quote
Old 09-23-2016, 08:03 AM   #24
AlPe
Digital Amanuensis
AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.
 
AlPe's Avatar
 
Posts: 727
Karma: 1446357
Join Date: Dec 2011
Location: Turin, Italy
Device: Several eReaders and tablets
Is someone still using the icarus plugin?
AlPe is offline   Reply With Quote
Old 12-08-2016, 04:30 PM   #25
AlPe
Digital Amanuensis
AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.AlPe ought to be getting tired of karma fortunes by now.
 
AlPe's Avatar
 
Posts: 727
Karma: 1446357
Join Date: Dec 2011
Location: Turin, Italy
Device: Several eReaders and tablets
In case someone is still using icarus, today I tested that the following triple works:

- Sigil 0.9.7
- icarus 0.0.3
- ePub3-itizer 0.3.6

Enjoy
AlPe is offline   Reply With Quote
Old 12-03-2017, 01:40 PM   #26
olavimmanuel
Junior Member
olavimmanuel began at the beginning.
 
Posts: 1
Karma: 10
Join Date: Dec 2017
Device: iPad Air
Smile Icarus still flies!

Thank you AlPe for the amazing Icarus and aeneas!

For anyone else using icarus; the following (latest versions atm) triple works:

- Sigil 0.9.8
- icarus 0.0.3
- ePub3-itizer 0.3.8

Enjoy
olavimmanuel is offline   Reply With Quote
Old 12-05-2017, 10:03 AM   #27
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 7,637
Karma: 5433388
Join Date: Nov 2009
Device: many
Glad to hear it!
KevinH is online now   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
epubcheck plugin for Sigil Doitsu Plugins 502 02-20-2024 06:58 AM
kindlegen plugin for Sigil Doitsu Plugins 169 02-16-2024 06:48 AM
[Plugin] KindleImport Sigil plugin DiapDealer Plugins 187 07-04-2022 10:11 AM
Sigil Plugin Index Thasaidon Plugins 0 10-04-2014 07:41 AM
Trying to load the Icarus firmware back on Icarus EXcel einkuser Onyx Boox 1 09-01-2014 11:34 PM


All times are GMT -4. The time now is 08:28 PM.


MobileRead.com is a privately owned, operated and funded community.