TEN-framework/ten-turn-detection

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TEN-framework/ten-turn-detection)

TEN-framework / ten-turn-detection

Turn detection for full-duplex dialogue communication

☆592

Alternatives and similar repositories for ten-turn-detection

Users that are interested in ten-turn-detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TEN-framework / ten-vad
View on GitHub
Voice Activity Detector (VAD) : low-latency, high-performance and lightweight
☆2,198Feb 2, 2026Updated 5 months ago
ASLP-lab / Easy-Turn
View on GitHub
Open-Source Turn-Taking Detection Model and Dataset for Full-Duplex Spoken Dialogue Systems
☆121Jan 25, 2026Updated 5 months ago
pipecat-ai / smart-turn
View on GitHub
☆1,475Jan 29, 2026Updated 5 months ago
TEN-framework / ten-framework
View on GitHub
Open-source framework for conversational voice AI agents
☆10,916Updated this week
Soul-AILab / SoulX-Duplug
View on GitHub
Plug-and-play streaming semantic VAD for real-time full-duplex spoken dialogue systems.
☆269Updated this week
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
wenet-e2e / west
View on GitHub
We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction
☆206Updated this week
FireRedTeam / FireRedVAD
View on GitHub
A SOTA Industrial-Grade Voice Activity Detection & Audio Event Detection, supporting 100+ languages, outperforming Silero-VAD, TEN-VAD, F…
☆469May 6, 2026Updated 2 months ago
latishab / turnsense
View on GitHub
A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.
☆59Mar 20, 2026Updated 4 months ago
FireRedTeam / FireRedChat
View on GitHub
A Fully Self-Hosted Solution for Full-Duplex Voice Interaction
☆568Sep 28, 2025Updated 9 months ago
stepfun-ai / Step-Audio2
View on GitHub
Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation…
☆1,482Mar 16, 2026Updated 4 months ago
vogent / vogent-turn
View on GitHub
Vogent Turn: fast, open-source turn-detection for Voice AI applications
☆52Oct 28, 2025Updated 8 months ago
keating666 / yzcbbs
View on GitHub
A Knowledge Base on Pre-made Dishes
☆105Jul 6, 2026Updated 2 weeks ago
exercise-book-yq / Supercodec
View on GitHub
☆51Mar 5, 2026Updated 4 months ago
xcc-zach / xtalk
View on GitHub
X-Talk is an open-source full-duplex cascaded spoken dialogue system framework enabling low-latency, interruptible, and human-like speech…
☆226Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
snakers4 / silero-vad
View on GitHub
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
☆9,615Updated this week
videosdk-live / NAMO-Turn-Detector-v1
View on GitHub
High-performance, semantic turn detection for conversational AI
☆44Oct 1, 2025Updated 9 months ago
k2-fsa / ZipVoice
View on GitHub
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆1,014Dec 2, 2025Updated 7 months ago
xingchensong / FlashCosyVoice
View on GitHub
FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.
☆250Feb 25, 2026Updated 4 months ago
XiaomiMiMo / MiMo-Audio
View on GitHub
MiMo-Audio: Audio Language Models are Few-Shot Learners
☆1,063Jun 17, 2026Updated last month
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
JusperLee / AudioTrust
View on GitHub
AudioTrust: Benchmarking the Multi-faceted Trustworthiness of Audio Large Language Models
☆215Jan 28, 2026Updated 5 months ago
Mddct / transformer-vocos
View on GitHub
☆35Sep 6, 2025Updated 10 months ago
ASLP-lab / OSUM
View on GitHub
OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.
☆494Nov 23, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
semanticVAD / testsets
View on GitHub
Testing sets for semanticVAD
☆20Feb 18, 2025Updated last year
xingchensong / TouchNet
View on GitHub
A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp.
☆232Jul 2, 2026Updated 2 weeks ago
pengzhendong / torchfa
View on GitHub
Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.
☆61Sep 5, 2025Updated 10 months ago
xiaomi-research / tts-prism
View on GitHub
☆47Apr 27, 2026Updated 2 months ago
JusperLee / TIGER
View on GitHub
TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation
☆433Apr 20, 2026Updated 3 months ago
FireRedTeam / FireRedASR
View on GitHub
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…
☆1,936Feb 25, 2026Updated 4 months ago
inclusionAI / Ming-UniAudio
View on GitHub
Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation
☆450Nov 27, 2025Updated 7 months ago
ASLP-lab / HumDial-FDBench
View on GitHub
The Full-Duplex Interaction Track of the ICASSP 2026 Human-like Spoken Dialogue Systems Challenge aims to advance the evaluation of full-…
☆36Apr 27, 2026Updated 2 months ago
suimuc / VIRES
View on GitHub
☆342Jul 4, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
DanielLin94144 / Full-Duplex-Bench
View on GitHub
A Benchmark for Evaluating Turn-Taking and Overlap Handling in Full-Duplex Spoken Dialogue Models
☆235May 20, 2026Updated 2 months ago
xkx-hub / KALL-E
View on GitHub
[AAAI 2026 oral] KALL-E:Autoregressive Speech Synthesis with Next-Distribution Prediction
☆41Sep 25, 2025Updated 9 months ago
greatInvoker / 2025-full-stack-tech-sharing
View on GitHub
2025技术分享（FullStack Frontend Focus），分享常用知识点。代码纯手打+AI验证，只做精品！！！
☆153Jul 2, 2025Updated last year
GenerTeam / GENERanno
View on GitHub
GENERanno: A Genomic Foundation Model for Metagenomic Annotation
☆314Jun 15, 2026Updated last month
AmphionTeam / Emilia-NV
View on GitHub
Official Repository of Paper: "Emilia-NV: A Non-Verbal Speech Dataset with Word-Level Annotation for Human-Like Speech Modeling"
☆91Sep 18, 2025Updated 10 months ago
ByteDance-Seed / EvaLearn
View on GitHub
EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…
☆431May 12, 2026Updated 2 months ago
gpt-omni / mini-omni
View on GitHub
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming…
☆3,562Nov 5, 2024Updated last year