latishab/turnsense

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/latishab/turnsense)

latishab / turnsense

A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.

☆60

Alternatives and similar repositories for turnsense

Users that are interested in turnsense are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aaronng91 / semantic-turn-detection
View on GitHub
Script to demonstrate how to use a Language Model for Semantic Turn Detection. Refer to blog post for full details.
☆19May 9, 2025Updated last year
ASLP-lab / Easy-Turn
View on GitHub
Open-Source Turn-Taking Detection Model and Dataset for Full-Duplex Spoken Dialogue Systems
☆122Jan 25, 2026Updated 6 months ago
abb128 / turndetection
View on GitHub
☆21Mar 7, 2025Updated last year
vogent / vogent-turn
View on GitHub
Vogent Turn: fast, open-source turn-detection for Voice AI applications
☆53Oct 28, 2025Updated 9 months ago
videosdk-live / NAMO-Turn-Detector-v1
View on GitHub
High-performance, semantic turn detection for conversational AI
☆44Oct 1, 2025Updated 9 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
dangvansam / livekit-plugins-tenvad
View on GitHub
TEN VAD low-latency voice activity detection for real-time streaming, integrated with livekit-agents
☆26Nov 13, 2025Updated 8 months ago
daanzu / py-silero-vad-lite
View on GitHub
Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies
☆17Nov 25, 2024Updated last year
Picovoice / voice-activity-benchmark
View on GitHub
Voice activity engine benchmark framework
☆23Jan 14, 2026Updated 6 months ago
MiuLab / Lattice-Transformer-SLU
View on GitHub
Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"
☆10Jul 8, 2020Updated 6 years ago
TEN-framework / ten-turn-detection
View on GitHub
Turn detection for full-duplex dialogue communication
☆597Dec 26, 2025Updated 7 months ago
BUTSpeechFIT / SOT-DiCoW
View on GitHub
Multi-talker ASR based on DiCoW with Serialized Output Training
☆21Sep 18, 2025Updated 10 months ago
MuyeHuang / DuplexOmni
View on GitHub
☆44Updated this week
Jazzcharles / AuroLA
View on GitHub
☆28Feb 23, 2026Updated 5 months ago
FireRedTeam / FireRedVAD
View on GitHub
A SOTA Industrial-Grade Voice Activity Detection & Audio Event Detection, supporting 100+ languages, outperforming Silero-VAD, TEN-VAD, F…
☆472May 6, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mtreviso / deepbond
View on GitHub
Deep neural approach to Boundary and Disfluency Detection - Based on my Master's work
☆20Jul 25, 2024Updated 2 years ago
modal-projects / modal-nvidia-asr
View on GitHub
☆42Mar 31, 2026Updated 3 months ago
ASLP-lab / Hum-Dial
View on GitHub
ICASSP2026 HumDial Challenge
☆51May 28, 2026Updated 2 months ago
yxduir / m2m-70
View on GitHub
☆18Jun 25, 2026Updated last month
khfs / DuplexMamba
View on GitHub
☆18Mar 6, 2026Updated 4 months ago
MaikeZuefle / f-actor
View on GitHub
☆28Jul 17, 2026Updated last week
FireRedTeam / FireRedChat
View on GitHub
A Fully Self-Hosted Solution for Full-Duplex Voice Interaction
☆571Sep 28, 2025Updated 10 months ago
fanlu / wenet
View on GitHub
Transformer based ASR Engine.
☆13Aug 23, 2021Updated 4 years ago
soonsoon2 / copilot-dev-day-ewha-king
View on GitHub
GitHub Copilot Dev Day × 이화여자대학교 KING 게임 동아리 | 2025년 4월 13일 (월) 19:00~21:00 | Microsoft 서울 광화문 사옥
☆26Apr 6, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
DanielLin94144 / Full-Duplex-Bench
View on GitHub
A Benchmark for Evaluating Turn-Taking and Overlap Handling in Full-Duplex Spoken Dialogue Models
☆245May 20, 2026Updated 2 months ago
Soul-AILab / SoulX-Duplug
View on GitHub
Plug-and-play streaming semantic VAD for real-time full-duplex spoken dialogue systems.
☆278Jul 17, 2026Updated last week
renyuanL / ry-Speech-commands
View on GitHub
☆19Jan 5, 2020Updated 6 years ago
isadrtdinov / kws-attention
View on GitHub
Attention-based model for keywords spotting
☆19Aug 9, 2021Updated 4 years ago
Kirili4ik / kws-attention-pytorch
View on GitHub
Keyword spotting for audio with attention (KWS model for audio)
☆18Jul 15, 2021Updated 5 years ago
ddxsg24 / Personalized-Speech-Enhancement
View on GitHub
ASLP Summer Inter@NPU
☆13Jul 30, 2024Updated last year
RTE-Dev / Conversational-AI-for-the-Curious
View on GitHub
Conversational AI cookbook for developers — exploring real-time voice agents, streaming, and orchestration. 对话式 AI 开发者手册：探索实时语音、编排与工程实践。
☆23Nov 13, 2025Updated 8 months ago
xcc-zach / xtalk
View on GitHub
X-Talk is an open-source full-duplex cascaded spoken dialogue system framework enabling low-latency, interruptible, and human-like speech…
☆233Updated this week
yu-haoyuan / fd-badcat
View on GitHub
fd-sds
☆21Apr 8, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SalesforceAIResearch / enterprise-realtime-voice-agent
View on GitHub
☆47Jun 2, 2026Updated last month
ASLP-lab / FastTurn
View on GitHub
☆35May 19, 2026Updated 2 months ago
Alittleegg / Eureka-Audio
View on GitHub
Eureka-Audio: A 1.7B lightweight audio–language model that matches 7B–30B models on ASR, audio understanding, and paralinguistic reasonin…
☆40Apr 11, 2026Updated 3 months ago
zxzhao0 / C2SER
View on GitHub
We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…
☆49Mar 3, 2025Updated last year
Picovoice / cobra
View on GitHub
On-device voice activity detection (VAD) powered by deep learning
☆266Updated this week
mbzuai-oryx / LLMVoX
View on GitHub
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
☆308May 16, 2025Updated last year
tzyll / KeSpeech
View on GitHub
The repo provides information about KeSpeech Mandarin dialect dataset.
☆184Oct 13, 2022Updated 3 years ago