Pseudo Streaming SenseVoice with Hotwords
☆449Mar 13, 2025Updated last year
Alternatives and similar repositories for streaming-sensevoice
Users that are interested in streaming-sensevoice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Port of Funasr's Sense-voice model in C/C++☆550Dec 19, 2025Updated 5 months ago
- CTC decoder with hotwords for ASR.☆36Apr 13, 2025Updated last year
- ☆23Oct 17, 2024Updated last year
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- Multilingual Voice Understanding Model☆8,216May 19, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆540Oct 23, 2024Updated last year
- faster inference☆28Jan 20, 2025Updated last year
- 使用vllm加速cosyvoice2的推理☆494Apr 26, 2025Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆111Oct 6, 2025Updated 7 months ago
- Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…☆1,887Feb 25, 2026Updated 3 months ago
- ☆23Oct 30, 2024Updated last year
- One command to build TLG.fst for WeNet.☆30Oct 11, 2022Updated 3 years ago
- Streaming Vocos☆31Jun 10, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆136Apr 26, 2023Updated 3 years ago
- We Speech Transcript based on LLM, in 300 lines of code.☆184Jun 20, 2025Updated 11 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆893Dec 10, 2025Updated 5 months ago
- Python Wrapper of Silero VAD☆64May 8, 2025Updated last year
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆209Nov 2, 2025Updated 6 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆115Dec 2, 2025Updated 5 months ago
- Python runtime for WeTextProcessing (does not depend on Pynini)☆51Nov 28, 2025Updated 6 months ago
- Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-…☆16,264Updated this week
- paraformer(chinense asr) online onnx runtime for python☆54Mar 27, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Compute WER and SER for speech recognition evaluation☆26Mar 18, 2026Updated 2 months ago
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆61Sep 5, 2025Updated 8 months ago
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…☆12,416May 20, 2026Updated last week
- low-latency realtime ASR based on FireRedASR☆61Jul 8, 2025Updated 10 months ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization☆2,954Dec 8, 2025Updated 5 months ago
- noise reduction☆17Jul 3, 2024Updated last year
- Text Normalization & Inverse Text Normalization☆774Feb 27, 2026Updated 3 months ago
- GLM-4-Voice | 端到端中英语音对话模型☆3,175Dec 5, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆115Aug 16, 2024Updated last year
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆158Aug 9, 2025Updated 9 months ago
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 10 months ago
- Streaming Text to Speech Web UI☆22May 6, 2024Updated 2 years ago
- ☆203Sep 24, 2024Updated last year
- ☆36Sep 6, 2025Updated 8 months ago
- MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction mode…☆219Jan 8, 2025Updated last year