Open Voice OS Status Page
-
Updated
Jun 12, 2024 - Markdown
Open Voice OS Status Page
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for efficient audio transcription.
Go SDK for Deepgram's automated speech recognition APIs.
OBS plugin for local speech recognition and captioning using AI
A chatbot with both prompt and voicechat capabilities. When using voicechat, the user can immerse themselves in the experience by selecting a narrator, like a pirate for instance.
Web app that converts speech to a text transcript and lets you save the generated transcripts to OneDrive using Microsoft Graph
A PyTorch-based Speech Toolkit
🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
Automatically generate multi-language subtitles using AWS AI/ML services. Machine generated subtitles can be edited to improve accuracy and downstream tracks will automatically be regenerated based on the edits. Built on Media Insights Engine (https://github.com/awslabs/aws-media-insights-engine)
A video call apps to enable deaf people to communicate with normal people using sign language recognition and speech-to-text
🧠 Leon is your open-source personal assistant.
Port of OpenAI's Whisper model in C/C++
.NET MAUI Samples
A simple speech-to-text and text-to-speech program/frontend.
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript
Wyoming protocol server for Microsoft Azure speech-to-text
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.
To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."