Quote:
Originally Posted by eclipse123
I tried a few audio books - each one is one very long file (6-12 hours) and they all get the same problem, when I try shorter audio files, they work fine.
|
That explains the issue.
Yes, at the moment aeneas is limited by the amount of RAM your machine has. From
https://github.com/readbeyond/aeneas...ssing-features :
Quote:
No protection against memory trashing if you feed extremely long audio files
|
On my 4 GB RAM laptop, the longest single file I was able to run was about ~1h30m --- the exact size depends on your OS consumption of RAM, other programs running concurrently, etc. (Also on aeneasweb.org there are 4 GB of RAM.)
So, the only thing you can do at the moment consists in splitting that audio file into several pieces (chapters?), and create a task (= audio/text pair) for each.
In theory this memory issue can be worked around --- instead of loading the entire WAVE file in memory, the MFCCs can be computed in "stream mode", and the DTW computed piece-wise --- but that would require a substantial amount of work that I cannot afford to do now.
Thanks for reporting this problem, I will clarify that in the docs, and add a check in aeneas to trigger a human-readable exception.
EDIT: logged as a new issue:
https://github.com/readbeyond/aeneas/issues/57