Pseudo Streaming SenseVoice with Hotwords
☆444Mar 13, 2025Updated last year
Alternatives and similar repositories for streaming-sensevoice
Users that are interested in streaming-sensevoice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Port of Funasr's Sense-voice model in C/C++☆542Dec 19, 2025Updated 4 months ago
- CTC decoder with hotwords for ASR.☆35Apr 13, 2025Updated last year
- ☆23Oct 17, 2024Updated last year
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- Multilingual Voice Understanding Model☆7,957Dec 30, 2025Updated 3 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆539Oct 23, 2024Updated last year
- faster inference☆28Jan 20, 2025Updated last year
- 使用vllm加速cosyvoice2的推理☆491Apr 26, 2025Updated 11 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆112Oct 6, 2025Updated 6 months ago
- Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…☆1,845Feb 25, 2026Updated last month
- ☆23Oct 30, 2024Updated last year
- One command to build TLG.fst for WeNet.☆30Oct 11, 2022Updated 3 years ago
- Streaming Vocos☆30Jun 10, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆134Apr 26, 2023Updated 2 years ago
- We Speech Transcript based on LLM, in 300 lines of code.☆185Jun 20, 2025Updated 9 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆890Dec 10, 2025Updated 4 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆202Nov 2, 2025Updated 5 months ago
- Python Wrapper of Silero VAD☆64May 8, 2025Updated 11 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆114Dec 2, 2025Updated 4 months ago
- Python runtime for WeTextProcessing (does not depend on Pynini)☆49Nov 28, 2025Updated 4 months ago
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆15,643Mar 17, 2026Updated last month
- paraformer(chinense asr) online onnx runtime for python☆54Mar 27, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated last month
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…☆11,658Updated this week
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆61Sep 5, 2025Updated 7 months ago
- low-latency realtime ASR based on FireRedASR☆60Jul 8, 2025Updated 9 months ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization☆2,884Dec 8, 2025Updated 4 months ago
- noise reduction☆17Jul 3, 2024Updated last year
- Text Normalization & Inverse Text Normalization☆749Feb 27, 2026Updated last month
- GLM-4-Voice | 端到端中英语音对话模型☆3,176Dec 5, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆111Aug 16, 2024Updated last year
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆157Aug 9, 2025Updated 8 months ago
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 8 months ago
- Streaming Text to Speech Web UI☆22May 6, 2024Updated last year
- ☆204Sep 24, 2024Updated last year
- ☆36Sep 6, 2025Updated 7 months ago
- MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction mode…☆218Jan 8, 2025Updated last year