pipecat-ai/smart-turn

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pipecat-ai/smart-turn)

pipecat-ai / smart-turn

☆1,488

Alternatives and similar repositories for smart-turn

Users that are interested in smart-turn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TEN-framework / ten-turn-detection
View on GitHub
Turn detection for full-duplex dialogue communication
☆597Dec 26, 2025Updated 7 months ago
pipecat-ai / pipecat-flows
View on GitHub
Open source conversation framework for structured Pipecat dialogues
☆620Jul 5, 2026Updated 3 weeks ago
ASLP-lab / Easy-Turn
View on GitHub
Open-Source Turn-Taking Detection Model and Dataset for Full-Duplex Spoken Dialogue Systems
☆122Jan 25, 2026Updated 6 months ago
pipecat-ai / pipecat
View on GitHub
Open Source framework for voice and multimodal conversational AI
☆13,774Updated this week
snakers4 / silero-vad
View on GitHub
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
☆9,764Jul 16, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
latishab / turnsense
View on GitHub
A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.
☆60Mar 20, 2026Updated 4 months ago
TEN-framework / ten-vad
View on GitHub
Voice Activity Detector (VAD) : low-latency, high-performance and lightweight
☆2,210Feb 2, 2026Updated 5 months ago
FireRedTeam / FireRedVAD
View on GitHub
A SOTA Industrial-Grade Voice Activity Detection & Audio Event Detection, supporting 100+ languages, outperforming Silero-VAD, TEN-VAD, F…
☆472May 6, 2026Updated 2 months ago
abb128 / turndetection
View on GitHub
☆21Mar 7, 2025Updated last year
kyutai-labs / moshi
View on GitHub
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…
☆10,747May 16, 2026Updated 2 months ago
kyutai-labs / delayed-streams-modeling
View on GitHub
Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.
☆2,991Jan 26, 2026Updated 6 months ago
k2-fsa / ZipVoice
View on GitHub
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆1,021Dec 2, 2025Updated 7 months ago
pipecat-ai / whisker
View on GitHub
A low-level Pipecat debugger.
☆128Jul 20, 2026Updated last week
canopyai / Orpheus-TTS
View on GitHub
Towards Human-Sounding Speech
☆6,264Dec 5, 2025Updated 7 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
xingchensong / FlashCosyVoice
View on GitHub
FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.
☆250Feb 25, 2026Updated 5 months ago
fixie-ai / ultravox
View on GitHub
A fast multimodal LLM for real-time voice
☆4,499Dec 12, 2025Updated 7 months ago
pipecat-ai / pipecat-client-web
View on GitHub
Real-Time Voice Inference Web SDK
☆321Updated this week
Soul-AILab / SoulX-Duplug
View on GitHub
Plug-and-play streaming semantic VAD for real-time full-duplex spoken dialogue systems.
☆278Jul 17, 2026Updated last week
huggingface / speech-to-speech
View on GitHub
Build local voice agents with open-source models
☆7,223Updated this week
livekit / agents
View on GitHub
A framework for building realtime voice AI agents 🤖🎙️📹
☆11,538Updated this week
wenet-e2e / west
View on GitHub
We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction
☆206Jul 17, 2026Updated last week
stepfun-ai / Step-Audio2
View on GitHub
Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation…
☆1,485Mar 16, 2026Updated 4 months ago
vogent / vogent-turn
View on GitHub
Vogent Turn: fast, open-source turn-detection for Voice AI applications
☆53Oct 28, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
SesameAILabs / csm
View on GitHub
A Conversational Speech Generation Model
☆14,696May 27, 2025Updated last year
xcc-zach / xtalk
View on GitHub
X-Talk is an open-source full-duplex cascaded spoken dialogue system framework enabling low-latency, interruptible, and human-like speech…
☆233Updated this week
videosdk-live / NAMO-Turn-Detector-v1
View on GitHub
High-performance, semantic turn detection for conversational AI
☆44Oct 1, 2025Updated 9 months ago
kyutai-labs / moshi-finetune
View on GitHub
☆475Oct 3, 2025Updated 9 months ago
Marvis-Labs / marvis-tts
View on GitHub
☆365Aug 28, 2025Updated 11 months ago
modelscope / ClearerVoice-Studio
View on GitHub
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Spe…
☆4,343Aug 14, 2025Updated 11 months ago
yl4579 / StyleTTS2
View on GitHub
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆6,320Aug 10, 2024Updated last year
MatthewCYM / VoiceBench
View on GitHub
[TACL'26] VoiceBench: Benchmarking LLM-Based Voice Assistants
☆378Jun 11, 2026Updated last month
VITA-MLLM / Freeze-Omni
View on GitHub
✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
☆388May 27, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
pipecat-ai / docs
View on GitHub
Official Pipecat docs repo
☆38Updated this week
DanielLin94144 / Full-Duplex-Bench
View on GitHub
A Benchmark for Evaluating Turn-Taking and Overlap Handling in Full-Duplex Spoken Dialogue Models
☆245May 20, 2026Updated 2 months ago
Vyvo-Labs / VyvoTTS
View on GitHub
VyvoTTS: LLM-Based Text-to-Speech Training Framework
☆257Apr 8, 2026Updated 3 months ago
kwindla / aiewf-eval
View on GitHub
A long-context eval
☆144Jun 15, 2026Updated last month
moonshine-ai / moonshine
View on GitHub
Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces
☆10,504Updated this week
Standard-Intelligence / hertz-dev
View on GitHub
first base model for full-duplex conversational audio
☆1,794Jan 5, 2025Updated last year
maitrix-org / Voila
View on GitHub
☆496May 6, 2025Updated last year