speech-processing

Star

Here are 567 public repositories matching this topic...

X-LANCE / SLAM-LLM

Star

Speech, Language, Audio, Music Processing with Large Language Model

speech-processing audio-processing peft music-processing large-language-model multimodal-large-language-models

Updated Jun 12, 2024
Python

xmindflow / Awesome_Mamba

Star

Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis

natural-language-processing computer-vision deep-learning time-series survey medical-imaging remote-sensing speech-processing mamba medical-image-processing image-enhancement medical-image-analysis state-space-model medical-image-segmentation gnn large-language-models llm mamba-state-space-models

Updated Jun 11, 2024

speechbrain / speechbrain

Star

A PyTorch-based Speech Toolkit

Updated Jun 12, 2024
Python

pyannote / pyannote-audio

Star

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-recognition speaker-verification speech-processing speaker-diarization voice-activity-detection speech-activity-detection speaker-change-detection speaker-embedding overlapped-speech-detection

Updated Jun 11, 2024
Jupyter Notebook

DigitalPhonetics / IMS-Toucan

Star

Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.

text-to-speech deep-learning toolkit speech pytorch tts speech-synthesis speech-processing

Updated Jun 12, 2024
Python

ictnlp / StreamSpeech

Star

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Updated Jun 11, 2024
Python

Voice-Lab / VoiceLab

Star

Automated Reproducible Acoustical Analysis

python python3 open-science speech-processing voice-analysis acoustic-analysis voice-manipulation

Updated Jun 10, 2024
Python

Ryuk17 / SpeechAlgorithms

Star

Speech Algorithms

speech-processing

Updated Jun 9, 2024
C

coqui-ai / open-speech-corpora

Star

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

text-to-speech tts speech-synthesis voice-recognition speech-recognition speech-to-text stt speech-processing voice-activity-detection speech-separation speech-emotion-recognition voice-cloning

Updated Jun 6, 2024

hanifabd / voice-activity-detection-vad-realtime

Star

Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)

machine-learning websockets voice speech speech-recognition speech-to-text speech-processing web-service voice-assistant voice-bot live-transcript realtime-transcribe

Updated Jun 6, 2024
Python

abikaki / awesome-speech-emotion-recognition

Star

😎 Awesome lists about Speech Emotion Recognition

machine-learning awesome deep-neural-networks deep-learning emotion artificial-intelligence awesome-list human-computer-interaction speech-processing affective-computing sentiment-classification emotion-detection emotion-recognition multimodal-sentiment-analysis speech-emotion-recognition expressive-speech-synthesis multimodal-emotion-recognition emotional-speech speech-emotion-classification

Updated Jun 6, 2024

chimechallenge / chime-utils

Star

Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.

speech-recognition automatic-speech-recognition speech-processing speech-separation speech-enhancement far-field-speech-recognition diarization multi-speaker-asr meeting-transcription

Updated Jun 11, 2024
Python

pliang279 / awesome-multimodal-ml

Star

Reading list for research topics in multimodal machine learning

machine-learning natural-language-processing reinforcement-learning computer-vision deep-learning robotics healthcare reading-list representation-learning speech-processing multimodal-learning

Updated Jun 5, 2024

ddlBoJack / Speech-Resources

Star

语音方向实验室/公司/资源/实习等，欢迎推荐或自荐

speech speech-processing

Updated Jun 4, 2024

z3lx / speaker-identification

Star

Speaker identification on audio files using the pyannote/embedding model.

python speech-processing speaker-identification speaker-embedding

Updated Jun 3, 2024
Python

gtreshchev / RuntimeSpeechRecognizer

Star

Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.

voice-recognition speech-recognition openai unreal-engine ue4 speech-to-text whisper speech-processing audio-processing unreal-engine-4 ue4-plugin speech-detection whis ue5 unreal-engine-5 ue5-plugin whisper-cpp whisper-ai

Updated Jun 2, 2024
C++

haoheliu / voicefixer

Sponsor

Star

General Speech Restoration

speech tts speech-synthesis super-resolution speech-processing vocoder speech-analysis denoise mel speech-enhancement dereverberation declipping

Updated May 31, 2024
Python

raj-sutariya / indic-num2words

Star

Python library for converting numbers to words for all Indian Languages.

python nlp preprocessing speech-processing indic indian-languages

Updated May 30, 2024
Python

SuperKogito / fastft

Sponsor

Star

Implementation of [Librosa](https://github.com/librosa/librosa) like [STFT](https://en.wikipedia.org/wiki/Short-time_Fourier_transform) using [FFTW](https://www.fftw.org/)

audio signal-processing dsp audio-analysis stft speech-processing audio-processing

Updated May 30, 2024
C

SuperKogito / spafe

Sponsor

Star

🔉 spafe: Simplified Python Audio Features Extraction

Updated May 30, 2024
Python

Improve this page

Add a description, image, and links to the speech-processing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-processing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-processing

Here are 567 public repositories matching this topic...

X-LANCE / SLAM-LLM

xmindflow / Awesome_Mamba

speechbrain / speechbrain

pyannote / pyannote-audio

DigitalPhonetics / IMS-Toucan

ictnlp / StreamSpeech

Voice-Lab / VoiceLab

Ryuk17 / SpeechAlgorithms

coqui-ai / open-speech-corpora

hanifabd / voice-activity-detection-vad-realtime

abikaki / awesome-speech-emotion-recognition

chimechallenge / chime-utils

pliang279 / awesome-multimodal-ml

ddlBoJack / Speech-Resources

z3lx / speaker-identification

gtreshchev / RuntimeSpeechRecognizer

haoheliu / voicefixer

raj-sutariya / indic-num2words

SuperKogito / fastft

SuperKogito / spafe

Improve this page

Add this topic to your repo