Pseudo Streaming SenseVoice with Hotwords
☆434Mar 13, 2025Updated 11 months ago
Alternatives and similar repositories for streaming-sensevoice
Users that are interested in streaming-sensevoice are comparing it to the libraries listed below
Sorting:
- Port of Funasr's Sense-voice model in C/C++☆522Dec 19, 2025Updated 2 months ago
- CTC decoder with hotwords for ASR.☆34Apr 13, 2025Updated 10 months ago
- ☆23Oct 17, 2024Updated last year
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- Multilingual Voice Understanding Model☆7,669Dec 30, 2025Updated 2 months ago
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆538Oct 23, 2024Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆109Oct 6, 2025Updated 5 months ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- 使用vllm加速cosyvoice2的推理☆486Apr 26, 2025Updated 10 months ago
- faster inference☆28Jan 20, 2025Updated last year
- Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…☆1,788Feb 25, 2026Updated last week
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆129Apr 26, 2023Updated 2 years ago
- One command to build TLG.fst for WeNet.☆30Oct 11, 2022Updated 3 years ago
- Python Wrapper of Silero VAD☆64May 8, 2025Updated 10 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆185Jun 20, 2025Updated 8 months ago
- Python runtime for WeTextProcessing (does not depend on Pynini)☆48Nov 28, 2025Updated 3 months ago
- Streaming Vocos☆30Jun 10, 2025Updated 8 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆189Nov 2, 2025Updated 4 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆114Dec 2, 2025Updated 3 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆885Dec 10, 2025Updated 3 months ago
- ☆23Oct 30, 2024Updated last year
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆155Aug 9, 2025Updated 7 months ago
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆62Sep 5, 2025Updated 6 months ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- Open-Source Turn-Taking Detection Model and Dataset for Full-Duplex Spoken Dialogue Systems☆82Jan 25, 2026Updated last month
- paraformer(chinense asr) online onnx runtime for python☆53Mar 27, 2024Updated last year
- noise reduction☆17Jul 3, 2024Updated last year
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆15,036Feb 28, 2026Updated last week
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆109Aug 16, 2024Updated last year
- ☆205Sep 24, 2024Updated last year
- ☆29Feb 4, 2025Updated last year
- A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization☆2,811Dec 8, 2025Updated 3 months ago
- Streaming Text to Speech Web UI☆22May 6, 2024Updated last year
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…☆10,662Updated this week
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 7 months ago
- Text Normalization & Inverse Text Normalization☆727Feb 27, 2026Updated last week
- A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp.☆225Aug 6, 2025Updated 7 months ago
- ✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM☆368May 27, 2025Updated 9 months ago
- Colab notebooks for Next-gen Kaldi☆31Oct 12, 2025Updated 4 months ago