Implement speaker diarization, voice activity detection, and/or conversation endpointing. #97

kaeladair · 2024-03-20T01:37:41Z

Implement speaker diarization and VAD. This will let the agent understand who is speaking, providing the user with better responses. This should also get rid of audio hallucinations when there is silence, very important as the wearable will be recording during silence often if worn all the time.

Potential implementations:
https://github.com/pyannote/pyannote-audio
Deepgram

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement speaker diarization, voice activity detection, and/or conversation endpointing. #97

Implement speaker diarization, voice activity detection, and/or conversation endpointing. #97

kaeladair commented Mar 20, 2024

Implement speaker diarization, voice activity detection, and/or conversation endpointing. #97

Implement speaker diarization, voice activity detection, and/or conversation endpointing. #97

Comments

kaeladair commented Mar 20, 2024