Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
-
Updated
May 1, 2025 - Jupyter Notebook
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
A PyTorch-based Speech Toolkit
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
A nearly-live implementation of OpenAI's Whisper.
🇨🇳 功能全面的汉字工具库 (拼音 笔画 偏旁 成语 语音 可视化等) (Chinese character util)
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
🎤 React Native Voice Recognition library for iOS and Android (Online and Offline Support)
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
Python AI assistant 🧠
Easily select, start, and manage your preferred AI digital assistant
A lightweight, simple-to-use, RNN wake word listener
Jarvis.sh is a simple configurable multi-lang assistant.
Captains log and 3d star map for Elite Dangerous
可本地部署的AI语音工具箱 | A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.
On-device Speech-to-Intent engine powered by deep learning
speech to text benchmark framework
On-device voice assistant platform powered by deep learning
Add a description, image, and links to the voice-recognition topic page so that developers can more easily learn about it.
To associate your repository with the voice-recognition topic, visit your repo's landing page and select "manage topics."