speech-to-text

Here are 2,892 public repositories matching this topic...

OpenVoiceOS / status

Open Voice OS Status Page

status text-to-speech translator monitoring alerting cuda sam nvidia tts uptime stats speech-to-text stt piper ovos upptime openvoiceos fasterwhisper mimic3

Updated Jun 12, 2024
Markdown

MahmoudAshraf97 / whisper-diarization

Star

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

speech speech-recognition speech-to-text whisper asr speaker-diarization

Updated Jun 12, 2024
Jupyter Notebook

jianchang512 / pyvideotrans

Star

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并添加配音

text-to-speech speech-to-text video-transition

Updated Jun 12, 2024
Python

ErcinDedeoglu / WhisperDock

Star

Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for efficient audio transcription.

api docker machine-learning speech-to-text audio-transcription whisper-cpp

Updated Jun 12, 2024
C++

deepgram / deepgram-go-sdk

Star

Go SDK for Deepgram's automated speech recognition APIs.

go speech-recognition speech-to-text hacktoberfest deepgram

Updated Jun 11, 2024
Go

occ-ai / obs-localvocal

Star

OBS plugin for local speech recognition and captioning using AI

plugin translation ai livestream live-streaming speech-recognition speech-to-text obs transcription obs-studio whisper realtime-translator obs-studio-plugin realtime-transcribe openai-whisper whisper-cpp real-time-transcription

Updated Jun 11, 2024
C++

barrylee111 / voicechat-LLM

Star

A chatbot with both prompt and voicechat capabilities. When using voicechat, the user can immerse themselves in the experience by selecting a narrator, like a pirate for instance.

react python text-to-speech websocket speech-to-text whisper fastapi largelanguagemodel

Updated Jun 11, 2024
Python

richardrigutins / my-transcripts

Star

Web app that converts speech to a text transcript and lets you save the generated transcripts to OneDrive using Microsoft Graph

graph dotnet azure dotnet-core speech-to-text cognitive-services microsoft-graph microsoft-graph-sdk blazor blazor-server hacktogether hack-together

Updated Jun 11, 2024
C#

speechbrain / speechbrain

Star

A PyTorch-based Speech Toolkit

Updated Jun 11, 2024
Python

KevKibe / African-Whisper

Star

🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.

speech speech-recognition speech-to-text whisper asr speech-translation speech-transcription

Updated Jun 11, 2024
Python

mkiol / dsnote

Star

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

text-to-speech translator translation offline machine-translation sailfishos tts speech-synthesis speech-recognition speech-to-text nmt linux-desktop stt asr flatpak-applications

Updated Jun 11, 2024
C++

aws-solutions / content-localization-on-aws

Star

Automatically generate multi-language subtitles using AWS AI/ML services. Machine generated subtitles can be edited to improve accuracy and downstream tracks will automatically be regenerated based on the edits. Built on Media Insights Engine (https://github.com/awslabs/aws-media-insights-engine)

audio nlp video localization vod media localisation captions subtitles speech-to-text amazon-polly nlp-machine-learning content-analysis mie video-on-demand amazon-comprehend amazon-translate amazon-transcribe aws-media-insights-engine

Updated Jun 11, 2024
Vue

baharudin-yusup / salingsapa

Star

A video call apps to enable deaf people to communicate with normal people using sign language recognition and speech-to-text

android ios text-to-speech firebase webrtc clean-architecture speech-to-text flutter bloc agora sign-language-recognition tensorflow-lite codemagic

Updated Jun 11, 2024
Dart

leon-ai / leon

Star

🧠 Leon is your open-source personal assistant.

Updated Jun 11, 2024
Python

ggerganov / whisper.cpp

Sponsor

Star

Port of OpenAI's Whisper model in C/C++

inference transformer speech-recognition openai speech-to-text whisper

Updated Jun 11, 2024
C

VladislavAntonyuk / MauiSamples

Star

.NET MAUI Samples

markdown ios paint xamarin dotnet azure sqlite ukraine xamarin-forms speech-to-text hacktoberfest kanban-board maui blazor ios-extensions dotnet-maui maui-blazor dotnet-maui-blazor

Updated Jun 11, 2024
C#

frank038 / gspeechread

Star

A simple speech-to-text and text-to-speech program/frontend.

linux text-to-speech python3 gtk3 speech-recognition speech-to-text

Updated Jun 11, 2024
Python

k2-fsa / sherpa-onnx

Star

Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript