pengzhendong / streaming-sensevoiceView external linksLinks
Pseudo Streaming SenseVoice with Hotwords
☆429Mar 13, 2025Updated 11 months ago
Alternatives and similar repositories for streaming-sensevoice
Users that are interested in streaming-sensevoice are comparing it to the libraries listed below
Sorting:
- Port of Funasr's Sense-voice model in C/C++☆516Dec 19, 2025Updated last month
- CTC decoder with hotwords for ASR.☆34Apr 13, 2025Updated 10 months ago
- ☆23Oct 17, 2024Updated last year
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- Multilingual Voice Understanding Model☆7,497Dec 30, 2025Updated last month
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆539Oct 23, 2024Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆109Oct 6, 2025Updated 4 months ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- 使用vllm加速cosyvoice2的推理☆482Apr 26, 2025Updated 9 months ago
- faster inference☆28Jan 20, 2025Updated last year
- Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…☆1,766Updated this week
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆128Apr 26, 2023Updated 2 years ago
- One command to build TLG.fst for WeNet.☆30Oct 11, 2022Updated 3 years ago
- Python Wrapper of Silero VAD☆64May 8, 2025Updated 9 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆183Jun 20, 2025Updated 7 months ago
- Python runtime for WeTextProcessing (does not depend on Pynini)☆48Nov 28, 2025Updated 2 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆114Dec 2, 2025Updated 2 months ago
- Open-Source Turn-Taking Detection Model and Dataset for Full-Duplex Spoken Dialogue Systems☆75Jan 25, 2026Updated 3 weeks ago
- ☆23Oct 30, 2024Updated last year
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆188Nov 2, 2025Updated 3 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆887Dec 10, 2025Updated 2 months ago
- noise reduction☆17Jul 3, 2024Updated last year
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆154Aug 9, 2025Updated 6 months ago
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆62Sep 5, 2025Updated 5 months ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 10 months ago
- paraformer(chinense asr) online onnx runtime for python☆53Mar 27, 2024Updated last year
- Streaming Vocos☆29Jun 10, 2025Updated 8 months ago
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆14,891Feb 4, 2026Updated last week
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆109Aug 16, 2024Updated last year
- ☆204Sep 24, 2024Updated last year
- ☆29Feb 4, 2025Updated last year
- A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization☆2,778Dec 8, 2025Updated 2 months ago
- Streaming Text to Speech Web UI☆22May 6, 2024Updated last year
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…☆10,344Updated this week
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 6 months ago
- Colab notebooks for Next-gen Kaldi☆29Oct 12, 2025Updated 4 months ago
- Text Normalization & Inverse Text Normalization☆726Feb 3, 2026Updated 2 weeks ago
- A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp.☆224Aug 6, 2025Updated 6 months ago
- ✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM☆365May 27, 2025Updated 8 months ago