ruzhila/voiceapi

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ruzhila/voiceapi)

ruzhila / voiceapi

Streaming ASR and TTS based on FastAPI+ sherpa-onnx

☆222

Alternatives and similar repositories for voiceapi

Users that are interested in voiceapi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

k2-fsa / sherpa-onnx
View on GitHub
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…
☆13,783Updated this week
mawwalker / stt-server
View on GitHub
stt websockect server using sherpa-onnx
☆57Feb 28, 2026Updated 4 months ago
hfyydd / sherpa-onnx-server
View on GitHub
☆48Jan 20, 2025Updated last year
jundaychan / funasr-fastapi
View on GitHub
funasr语音转文字的简单api版本，funasr+fastapi，方便部署在服务器上
☆13Aug 10, 2024Updated last year
pengzhendong / streaming-sensevoice
View on GitHub
Pseudo Streaming SenseVoice with Hotwords
☆467Jun 15, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Jason-chen-coder / Flutter-EasySpeechRecognition
View on GitHub
基于 Sherpa-ONNX 实现在线下载模型的端侧实时语音识别应用（Implement speech recognition based on Sherpa-ONNX by downloading the model online.）
☆30Feb 27, 2025Updated last year
lovemefan / fsmn-vad
View on GitHub
A enterprise-grade Voice Activity Detector from modelscope and funasr.
☆139Apr 26, 2023Updated 3 years ago
lovemefan / SenseVoice.cpp
View on GitHub
Port of Funasr's Sense-voice model in C/C++
☆568Dec 19, 2025Updated 7 months ago
qkl9527 / voice-assistant
View on GitHub
基于Funasr的[实时]AI语音助手
☆25Dec 18, 2025Updated 7 months ago
bbeyondllove / asr_server
View on GitHub
一个基于 Sherpa-ONNX 的高性能语音识别服务，支持实时VAD（语音活动检测）、多语言语音识别和声纹识别功能。
☆115Jan 4, 2026Updated 6 months ago
xphh / fireredasr-streaming
View on GitHub
low-latency realtime ASR based on FireRedASR
☆62Jul 8, 2025Updated last year
0x5446 / api4sensevoice
View on GitHub
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…
☆538Oct 23, 2024Updated last year
HaujetZhao / SenseVoice-ONNX
View on GitHub
SenseVoice-Small 导出为 ONNX，支持热词注入，在 CTC 的输空间中通过路径匹配，1ms 内实现热词替换
☆28Jun 3, 2026Updated last month
wangzhaode / mnn-asr
View on GitHub
mnn asr demo.
☆27Mar 24, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
k2-fsa / ZipVoice
View on GitHub
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆1,019Dec 2, 2025Updated 7 months ago
HaujetZhao / FunASR-Online-Paraformer-Test
View on GitHub
☆52Nov 26, 2023Updated 2 years ago
jianchang512 / sense-api
View on GitHub
用于SenseVoice的api项目，输出带时间戳字幕
☆49Oct 28, 2024Updated last year
ABexit / ASR-LLM-TTS
View on GitHub
This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice…
☆1,262Jun 3, 2026Updated last month
xinhecuican / QSmartAssistant
View on GitHub
一个模块化，全过程可离线，低占用率的对话机器人/智能音箱
☆159Mar 25, 2026Updated 4 months ago
lukeewin / FunASR_API
View on GitHub
这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.
☆27Jun 16, 2026Updated last month
lovemefan / paraformer.cpp
View on GitHub
Port of Funasr's Paraformer model in C/C++
☆43Jun 19, 2024Updated 2 years ago
TEN-framework / ten-vad
View on GitHub
Voice Activity Detector (VAD) : low-latency, high-performance and lightweight
☆2,204Feb 2, 2026Updated 5 months ago
FireRedTeam / FireRedASR
View on GitHub
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…
☆1,940Feb 25, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
k2-fsa / sherpa-mlx
View on GitHub
sherpa with mlx
☆15Aug 2, 2025Updated 11 months ago
pengzhendong / asr-decoder
View on GitHub
CTC decoder with hotwords for ASR.
☆38Jun 15, 2026Updated last month
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,467Updated this week
wwbin2017 / bailing
View on GitHub
百聆是一个类似GPT-4o的语音对话机器人，通过ASR+LLM+TTS实现，集成DeepSeek R1等优秀大模型，接入openClaw，真正的个人语音助手，时延低至800ms，Mac等低配置也可运行，支持打断
☆1,742Apr 6, 2026Updated 3 months ago
wxqwinner / silero-vad-ncnn
View on GitHub
Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.
☆26Aug 21, 2024Updated last year
lovemefan / Paraformer-webserver
View on GitHub
paraformer web server build with sanic
☆28May 3, 2023Updated 3 years ago
lukeewin / AudioSeparationGUI
View on GitHub
这是一款基于FunASR实现的说话人分离的GUI程序
☆163Dec 14, 2025Updated 7 months ago
marcinmatys / whisper_streaming
View on GitHub
Whisper realtime streaming for long speech-to-text transcription and translation
☆22Nov 4, 2024Updated last year
k2-fsa / sherpa
View on GitHub
Speech-to-text server framework with next-gen Kaldi
☆962Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
patientx / F5-TTS-ONNX-gui
View on GitHub
Running the F5-TTS by ONNX Runtime standalone with GUI
☆27Dec 10, 2024Updated last year
AGENDD / RWKV-ASR
View on GitHub
This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …
☆54Dec 23, 2024Updated last year
season-studio / MeloTTS-ONNX
View on GitHub
An implementation of MeloTTS by onnxruntime
☆30Oct 27, 2024Updated last year
lovemefan / paraformer-python
View on GitHub
paraformer(chinense asr) online onnx runtime for python
☆54Mar 27, 2024Updated 2 years ago
SanHacks / AiGen
View on GitHub
Multi Model Personal Assistant Wrapper in Go: Interact with ChatGPT, Claude or Ollama Cross Platform (Speech & Image generation supported…
☆16Updated this week
RemSynch / SenseVoice-Real-Time
View on GitHub
简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目
☆42Sep 23, 2024Updated last year
k2-fsa / sherpa-ncnn
View on GitHub
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, …
☆1,760Oct 20, 2025Updated 9 months ago