Streaming ASR and TTS based on FastAPI+ sherpa-onnx
☆189Nov 2, 2025Updated 4 months ago
Alternatives and similar repositories for voiceapi
Users that are interested in voiceapi are comparing it to the libraries listed below
Sorting:
- Pseudo Streaming SenseVoice with Hotwords☆429Mar 13, 2025Updated 11 months ago
- 自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。☆12Dec 26, 2024Updated last year
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…☆10,526Updated this week
- Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.☆24Aug 21, 2024Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆129Apr 26, 2023Updated 2 years ago
- CTC decoder with hotwords for ASR.☆34Apr 13, 2025Updated 10 months ago
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆23Feb 12, 2026Updated 3 weeks ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Dec 23, 2024Updated last year
- Multi Model Personal Assistant Wrapper in Go: Interact with ChatGPT, Claude or Ollama Cross Platform (Speech & Image generation supported…☆16Jan 31, 2026Updated last month
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- Port of Funasr's Sense-voice model in C/C++☆522Dec 19, 2025Updated 2 months ago
- ☆30Jun 12, 2025Updated 8 months ago
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆27Nov 20, 2024Updated last year
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆538Oct 23, 2024Updated last year
- Running the F5-TTS by ONNX Runtime standalone with GUI☆24Dec 10, 2024Updated last year
- ☆23Oct 17, 2024Updated last year
- LLM-powered chatbot app to enhance accessibility to knowledge contained within PDFs☆15May 6, 2025Updated 10 months ago
- ☆18Feb 16, 2025Updated last year
- 用于SenseVoice的api项目,输出带时间戳字幕☆42Oct 28, 2024Updated last year
- A toolkit dedicate for speech evaluation.☆23Sep 26, 2024Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆109Oct 6, 2025Updated 5 months ago
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆155Aug 9, 2025Updated 6 months ago
- Simple Persian CAPTCHA generator☆11Feb 17, 2025Updated last year
- A python library which simplifies creating and exporting videos.☆11Oct 1, 2023Updated 2 years ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- ☆11Sep 1, 2024Updated last year
- pyrogram client bot project☆10Jun 12, 2022Updated 3 years ago
- Asynchronous Nekobin API Wrapper for Python3☆11Jul 23, 2023Updated 2 years ago
- 🤖🧭Creates google-like navigation menu using python-telegram-bot wrapper☆10Apr 8, 2022Updated 3 years ago
- Telegram Inline Files Search Bot made using JS by @Kunal-Diwan☆10Nov 9, 2021Updated 4 years ago
- A simple Telegram bot to download webpages as PDF☆11Feb 9, 2024Updated 2 years ago
- Create and manage advanced polls with this Telegram Bot which has many features available!☆10Feb 9, 2026Updated 3 weeks ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- pytorch+bert实现的意图识别与槽位填充☆11May 30, 2023Updated 2 years ago
- Example python scripts to evaluate various ASR methods☆11Dec 22, 2021Updated 4 years ago
- A Python client for https://Yun.ir URL shortener API.☆13Mar 24, 2025Updated 11 months ago
- ☆10Jul 9, 2025Updated 7 months ago
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year