Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"
-
Updated
Jun 12, 2024 - Python
Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"
Speaker diarization service
✍ 🗣 A Text-To-Conversation natural language processing toolkit [WIP].
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
An official implementation for SSAMBA: Self-Supervised Audio Mamba
Speaker recognition app using machine learning with recording and import features with GUI.
Speaker identification on audio files using the pyannote/embedding model.
Champion at Brainhack TIL 2023: Team 10000SGDMRT
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
On-device speaker recognition engine powered by deep learning
This project performs speech recognition and diarization (speaker identification) on recordings of conversations. This is followed by sentiment analysis the transcription of each individual.
A Streamlit web application for Voice recognition using a pre-trained speech embedding model.
Backend service for extracting and processing voice data
Official implementation of the ICASSP 2024 paper: Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
Speakerbox: Fine-tune Audio Transformers for speaker identification.
This project was done as part of a research teaser project on Speaker Recognition conducted with IIIT Hydrabad.
Deliverables relating to the Speech Technology University Unit (Notes Courtesy to Dr. Andrea De Marco)
Recognizing and identifying Quran reciters from audio recordings.
This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.
Add a description, image, and links to the speaker-identification topic page so that developers can more easily learn about it.
To associate your repository with the speaker-identification topic, visit your repo's landing page and select "manage topics."