Quote:
Originally Posted by eclipse123
Another question - what happens if the audio file have slightly different text then the xhtml file? will it cause the software to exit? (I don't think this is the case, but just wondering)
|
aeneas, doing forced alignment via MFCC+DTW and not doing speech recognition, is "robust against misspelled/mispronounced words, local rearrangements of words, background noise/sporadic spikes". (From:
https://github.com/readbeyond/aeneas...orted-features )