Port of OpenAI's Whisper model in C/C++
-
Updated
Jun 17, 2025 - C++
Port of OpenAI's Whisper model in C/C++
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Faster Whisper transcription with CTranslate2
🧠 Leon is your open-source personal assistant.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
kaldi-asr/kaldi is the official location of the Kaldi project.
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
A PyTorch-based Speech Toolkit
Speech recognition module for Python, supporting several engines and APIs, online and offline.
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
💬 Speech recognition for your site
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, support 12 programming languages
Multilingual Voice Understanding Model
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.
To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."