ErikEkstedt / TurnGPTView external linksLinks
TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog
☆64May 18, 2024Updated last year
Alternatives and similar repositories for TurnGPT
Users that are interested in TurnGPT are comparing it to the libraries listed below
Sorting:
- Datasets for turn-taking research☆17Dec 21, 2023Updated 2 years ago
- Voice Activity Projection Models: Self-supervised learning of Turn-taking Events☆93May 29, 2024Updated last year
- vad☆25Apr 3, 2023Updated 2 years ago
- ☆14Feb 9, 2023Updated 3 years ago
- ☆17Apr 12, 2021Updated 4 years ago
- A real-time implementation of Voice Activity Projection (VAP) is aimed at controlling behaviors of spoken dialogue systems, such as turn-…☆93Jul 24, 2025Updated 6 months ago
- ☆15Aug 19, 2023Updated 2 years ago
- Joint speech-language model - respond directly to audio!☆30May 13, 2024Updated last year
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆20Jan 3, 2023Updated 3 years ago
- ☆43Aug 17, 2024Updated last year
- Crowdsourced and Automatic Speech Prominence Estimation☆24Apr 12, 2024Updated last year
- Awesome paper lists for "A Desideratum for Conversational Agents: Capabilities, Challenges, and Future Directions""☆30Apr 25, 2025Updated 9 months ago
- The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)☆48Aug 15, 2025Updated 6 months ago
- [ACL 2025] Predicting Turn-Taking and Backchannel in Human-Machine Conversations Using Linguistic, Acoustic, and Visual Signals☆30Aug 11, 2025Updated 6 months ago
- ☆49Nov 24, 2022Updated 3 years ago
- ☆22Jan 17, 2025Updated last year
- The official codebase of FineAction dataset. We will update the data and code of our FineAction.☆22Apr 10, 2025Updated 10 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆56May 26, 2025Updated 8 months ago
- ☆23Apr 18, 2022Updated 3 years ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Nov 15, 2023Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Sep 23, 2020Updated 5 years ago
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …☆40Mar 13, 2024Updated last year
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆21Jan 19, 2026Updated 3 weeks ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆85Jan 9, 2024Updated 2 years ago
- Predict and recommend the news articles, user is most likely to click in real time.☆32Apr 3, 2018Updated 7 years ago
- Online hotel booking system.☆11Apr 15, 2013Updated 12 years ago
- Some tools I was used in my project☆12Jun 20, 2017Updated 8 years ago
- Lifelong machine learning is a novel machine learning paradigm which continually learns tasks and accumulates knowledge for reusing. The …☆10Sep 15, 2019Updated 6 years ago
- end-to-end voicebot that answers open domain questions.☆10Oct 23, 2021Updated 4 years ago
- Project Gold ✨☆11Jan 29, 2026Updated 2 weeks ago
- Terrier's desktop search demo product☆13Aug 2, 2018Updated 7 years ago
- Collaborative Discourse Manager☆11Nov 6, 2016Updated 9 years ago
- ☆10Oct 20, 2022Updated 3 years ago
- PicWish T2I, Photo Enhancer and Background Remover for Python☆24Jul 6, 2025Updated 7 months ago
- A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck☆10Sep 9, 2022Updated 3 years ago
- A way to run Python Faded Parsons problems entirely in the browser.☆11Aug 30, 2023Updated 2 years ago
- Implementation of Siamese CBOW using keras whose backend is tensorflow.☆12Feb 2, 2023Updated 3 years ago
- ☆14May 20, 2025Updated 8 months ago
- node.js based CMS , built on top of grapejs framework☆12Sep 8, 2016Updated 9 years ago