View Single Post
Old 10-18-2024, 02:51 PM   #12
noodler
Member
noodler began at the beginning.
 
Posts: 10
Karma: 10
Join Date: Feb 2023
Device: none
It looks like an audio clip is only getting sent to the device when the audio state is toggled.

These are the logs at the moment I cancel the settings dialog after 10s without audio that makes one sentence to be read out loud:

Code:
[10.45] Audio state: State.IdleState
[10.45] Utterance 1 audio output finished
[10.47] Audio sent to output: maxlen=16384 len(ans)=16384
[10.47] Audio state: State.ActiveState
At all other times, all I see I see is piper zooming through the synthesis and "Waiting for audio to finish playing..." with no errors but no sound or "audio output finished" messages e.g.

Code:
[1.22] Utterance 3 synthesis started
[1.22] Synthesized data read: 36864 bytes
[1.22] [piper-debug] Phonemizing text: “I hold at your neck the gom jabbar,” she said.
[1.22] [piper-debug] Converting 50 phoneme(s) to ids: aɪ hˈoʊld æt jʊɹ nˈɛk ðə ɡˈɑːm dʒˈæbɑːɹ, ʃiː sˈɛd.
[1.22] [piper-debug] Converted 50 phoneme(s) to 103 phoneme id(s): xxx
[1.22] [piper-debug] Synthesizing audio for 103 phoneme id(s)
[1.50] [piper-debug] Synthesized 2.2639455782312927 second(s) of audio in 0.280647179 second(s)
[1.50] Synthesized data read: 65536 bytes
[1.50] [piper-info] Waiting for audio to finish playing...
[1.50] [piper-info] Real-time factor: 0.13556893212154525 (infer=0.9396494800000001 sec, audio=6.931156462585034 sec)
[1.50] Utterance 3 got 102400 bytes of audio data from piper
[1.50] Utterance 4 synthesis started
[1.50] Synthesized data read: 34304 bytes
[1.50] [piper-debug] Phonemizing text: “The gom jabbar, the highhanded enemy.
[1.50] [piper-debug] Converting 40 phoneme(s) to ids: ðə ɡˈɑːm dʒˈæbɑːɹ, ðə hˈaɪhændᵻd ˈɛnəmi.
[1.50] [piper-debug] Converted 40 phoneme(s) to 83 phoneme id(s): xxx
[1.50] [piper-debug] Synthesizing audio for 83 phoneme id(s)
[1.79] [piper-debug] Synthesized 2.345215419501134 second(s) of audio in 0.28609584 second(s)
[1.79] Synthesized data read: 65536 bytes
[1.79] [piper-info] Waiting for audio to finish playing...
[1.79] [piper-info] Real-time factor: 0.13213628513180542 (infer=1.2257453200000001 sec, audio=9.276371882086167 sec)
[1.79] Utterance 4 got 99840 bytes of audio data from piper
[1.79] Utterance 5 synthesis started
Please find the full log attached.
noodler is offline   Reply With Quote