Streaming ASR and TTS based on FastAPI+ sherpa-onnx
☆202Nov 2, 2025Updated 5 months ago
Alternatives and similar repositories for voiceapi
Users that are interested in voiceapi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于 Sherpa-ONNX 实现在线下载模型的端侧实时语音识别应用(Implement speech recognition based on Sherpa-ONNX by downloading the model online.)☆28Feb 27, 2025Updated last year
- Pseudo Streaming SenseVoice with Hotwords☆443Mar 13, 2025Updated last year
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…☆11,483Updated this week
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆24Feb 12, 2026Updated 2 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆133Apr 26, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- CTC decoder with hotwords for ASR.☆35Apr 13, 2025Updated last year
- 一个模块化,全过程可离线,低占用率的对话机器人/智能音箱☆148Mar 25, 2026Updated 3 weeks ago
- Port of Funasr's Sense-voice model in C/C++☆542Dec 19, 2025Updated 3 months ago
- Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.☆25Aug 21, 2024Updated last year
- Running the F5-TTS by ONNX Runtime standalone with GUI☆24Dec 10, 2024Updated last year
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Dec 23, 2024Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆22Nov 4, 2024Updated last year
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆539Oct 23, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- An implementation of MeloTTS by onnxruntime☆29Oct 27, 2024Updated last year
- Multi Model Personal Assistant Wrapper in Go: Interact with ChatGPT, Claude or Ollama Cross Platform (Speech & Image generation supported…☆16Mar 30, 2026Updated 2 weeks ago
- some ncnn demos of FunASR☆28Sep 23, 2024Updated last year
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Mar 8, 2026Updated last month
- ☆10May 5, 2025Updated 11 months ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- 一个基于 Sherpa-ONNX 的高性能语音识别服务,支持实时VAD(语音活动检测)、多语言语音识别和声纹识别功能。☆93Jan 4, 2026Updated 3 months ago
- ☆30Jun 12, 2025Updated 10 months ago
- 百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,接入openClaw,真正的个人语音助手,时延低至800ms,Mac等低配置也可运行,支持打断☆1,666Apr 6, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆27Nov 20, 2024Updated last year
- paraformer(chinense asr) online onnx runtime for python☆54Mar 27, 2024Updated 2 years ago
- ☆25Mar 8, 2026Updated last month
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- Simple Persian CAPTCHA generator☆11Feb 17, 2025Updated last year
- 用于SenseVoice的api项目,输出带时间戳字幕☆43Oct 28, 2024Updated last year
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated 3 weeks ago
- ☆44Jan 20, 2025Updated last year
- A repo for building just the audio processing module of WebRTC using CMake☆12Oct 18, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- paraformer web server build with sanic☆28May 3, 2023Updated 2 years ago
- ☆23Jul 17, 2024Updated last year
- This is a web-based intelligent dialogue program built using ASR, LLM, and TTS.☆24Dec 3, 2024Updated last year
- ☆50Nov 26, 2023Updated 2 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- ☆10Jul 9, 2025Updated 9 months ago
- ☆30Feb 4, 2025Updated last year