TEN-framework / ten-vadLinks
Voice Activity Detector(VAD) from TEN: low-latency, high-performance and lightweight
☆971Updated this week
Alternatives and similar repositories for ten-vad
Users that are interested in ten-vad are comparing it to the libraries listed below
Sorting:
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆554Updated last month
- ☆441Updated 2 months ago
- ☆426Updated 2 months ago
- OSUM: Open Speech Understanding Model, open-sourced by ASLP@NPU.☆376Updated 2 weeks ago
- Open source inference code for Rev's model☆412Updated 2 months ago
- A toolkit for speaker diarization.☆228Updated 3 weeks ago
- GPT-4o-level, real-time spoken dialogue system.☆345Updated 5 months ago
- Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…☆1,154Updated 3 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆171Updated last month
- Port of Funasr's Sense-voice model in C/C++☆398Updated 3 weeks ago
- Pseudo Streaming SenseVoice with Hotwords☆311Updated 4 months ago
- ☆274Updated 3 months ago
- MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting…☆430Updated last week
- Interface for OuteTTS models.☆1,335Updated 3 weeks ago
- ☆165Updated 7 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆854Updated 4 months ago
- StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.☆1,116Updated 3 weeks ago
- Local SRT/LLM/TTS Voicechat☆696Updated 9 months ago
- 使用vllm加速cosyvoice2的推理☆370Updated 2 months ago
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆588Updated 3 months ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆915Updated 8 months ago
- The world’s first real-time, distributed, cloud-edge collaborative multimodal AI Agent Framework that simultaneously supports C/C++/Go/Py…☆5Updated last month
- ☆201Updated 9 months ago
- A Fast TTS Engine☆526Updated 5 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆773Updated last month
- Speech-to-text server framework with next-gen Kaldi☆741Updated this week
- ☆505Updated 3 weeks ago
- ☆734Updated last year
- Whisper with Medusa heads☆849Updated last week
- ☆787Updated this week