[ACL 2025] Predicting Turn-Taking and Backchannel in Human-Machine Conversations Using Linguistic, Acoustic, and Visual Signals
☆32Aug 11, 2025Updated 9 months ago
Alternatives and similar repositories for MM-F2F
Users that are interested in MM-F2F are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Menagerie of video models trained on various video datasets☆10Oct 13, 2024Updated last year
- Behavioral probing of language acquisition models at the lexical and syntactic level☆20Jul 17, 2023Updated 2 years ago
- ☆17Apr 12, 2021Updated 5 years ago
- [ICLR 2025] Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning (SASR)☆11Aug 26, 2025Updated 9 months ago
- ☆14May 20, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and imp…☆13Jul 19, 2023Updated 2 years ago
- ☆12Jan 28, 2022Updated 4 years ago
- ☆22Apr 5, 2026Updated last month
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- Active Noise Control Headphone Implementation using Deep Learning-based GFANC☆13Aug 28, 2025Updated 9 months ago
- Unsupervised deep convolutional neural network model for the ventral visual stream.☆29Mar 24, 2023Updated 3 years ago
- This is the code for EEGDnet: Fusing non-local and local self-similarity for EEG signal denoising with transformer☆17Jun 3, 2024Updated last year
- Delayless Generative Fixed-filter Active Noise Control based on Deep Learning and Bayesian Filter☆15Aug 28, 2025Updated 9 months ago
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Jan 12, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Voice Activity Projection Models: Self-supervised learning of Turn-taking Events☆102May 29, 2024Updated 2 years ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated last year
- ☆22Feb 3, 2023Updated 3 years ago
- Real-time Implementation of CNN-based selective fixed-filter active noise control and effectiveness analysis using explainable AI☆29Apr 12, 2024Updated 2 years ago
- Soniox Compare. Compare real-time voice AI side by side. No glossy charts, just results.☆23Jul 15, 2025Updated 10 months ago
- ☆13Mar 25, 2021Updated 5 years ago
- [NeurIPS 2023] Official pytorch implementation of "Domain Re-Modulation for Few-Shot Generative Domain Adaption"☆13Aug 2, 2024Updated last year
- ☆16May 14, 2020Updated 6 years ago
- StyleTTS 2 Optimized Training Fork☆32Feb 2, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆41May 15, 2023Updated 3 years ago
- Python Client Library for Health Graph API (http://developer.runkeeper.com/healthgraph). The API is used for accessing RunKeeper.com (htt…☆27Aug 10, 2016Updated 9 years ago
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog☆68May 18, 2024Updated 2 years ago
- Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs☆13Feb 13, 2024Updated 2 years ago
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 10 months ago
- A real-time implementation of Voice Activity Projection (VAP) is aimed at controlling behaviors of spoken dialogue systems, such as turn-…☆101Jul 24, 2025Updated 10 months ago
- TimeGPT forecaster example using streamlit☆44Aug 5, 2023Updated 2 years ago
- ☆34Jun 15, 2021Updated 4 years ago
- ☆39Apr 3, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Feb 18, 2024Updated 2 years ago
- To overcome the limitation and obtain more appropriate control filters, a generative fixed-filter active noise control (GFANC) approach i…☆37Aug 28, 2025Updated 9 months ago
- ☆15Oct 24, 2023Updated 2 years ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆13Jul 15, 2024Updated last year
- ☆12Jan 4, 2022Updated 4 years ago
- The code used to create the ARCA23K and ARCA23K-FSD datasets☆16Nov 9, 2021Updated 4 years ago
- Neural Network based Sound Source Localization Models☆51Aug 29, 2023Updated 2 years ago