Speech Recognition API

这是一个基于 PaddleSpeech 的语音识别 API 服务，使用 FastAPI 构建。

功能

支持 WAV 格式的音频文件识别
使用 PaddleSpeech 的 deepspeech2online_wenetspeech 模型
提供 RESTful API 接口

本地运行

安装依赖：

pip install -r requirements.txt

启动服务：

python -m uvicorn app:app --reload --host 0.0.0.0 --port 8011


uvicorn app:app --host 0.0.0.0 --port 8011

Docker 运行

构建镜像：

docker build -t speech-recognition-api .

运行容器：

docker run -p 8011:8011 speech-recognition-api

API 使用

语音识别

curl -X POST "http://localhost:8011/recognize" \
  -H "accept: application/json" \
  -H "Content-Type: multipart/form-data" \
  -F "audio=@your_audio.wav"

健康检查

curl "http://localhost:8011/health"

Docker 镜像

Docker 镜像可以从 GitHub Container Registry 获取：

docker pull ghcr.io/your-username/speech-recognition-api:latest

开发

克隆仓库：

git clone https://github.com/your-username/speech-recognition-api.git
cd speech-recognition-api

安装开发依赖：

pip install -r requirements.txt

运行测试：

pytest

speech_python

语音相关服务，通过fastapi进行封装

所使用的开源模型： https://github.com/FunAudioLLM/SenseVoice

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.github/workflows		.github/workflows
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
temp_file_handler.py		temp_file_handler.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Speech Recognition API

功能

本地运行

Docker 运行

API 使用

语音识别

健康检查

Docker 镜像

开发

speech_python

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Languages

mumenma/speech_python

Folders and files

Latest commit

History

Repository files navigation

Speech Recognition API

功能

本地运行

Docker 运行

API 使用

语音识别

健康检查

Docker 镜像

开发

speech_python

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Languages

Packages