[ACL 2025] Predicting Turn-Taking and Backchannel in Human-Machine Conversations Using Linguistic, Acoustic, and Visual Signals
☆32Aug 11, 2025Updated 8 months ago
Alternatives and similar repositories for MM-F2F
Users that are interested in MM-F2F are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Menagerie of video models trained on various video datasets☆10Oct 13, 2024Updated last year
- Behavioral probing of language acquisition models at the lexical and syntactic level☆19Jul 17, 2023Updated 2 years ago
- ☆17Apr 12, 2021Updated 5 years ago
- [ICLR 2025] Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning (SASR)☆11Aug 26, 2025Updated 7 months ago
- ☆14May 20, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆20Apr 5, 2026Updated last week
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- Active Noise Control Headphone Implementation using Deep Learning-based GFANC☆13Aug 28, 2025Updated 7 months ago
- Unsupervised deep convolutional neural network model for the ventral visual stream.☆29Mar 24, 2023Updated 3 years ago
- This is the code for EEGDnet: Fusing non-local and local self-similarity for EEG signal denoising with transformer☆17Jun 3, 2024Updated last year
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Jan 12, 2025Updated last year
- Voice Activity Projection Models: Self-supervised learning of Turn-taking Events☆98May 29, 2024Updated last year
- Extrinsic calibration for the color and depth cameras of Pepper robot☆10Dec 1, 2017Updated 8 years ago
- An addon for the Godot-Engine, that allows runtime, scene-wide texture painting on the GPU.☆53Mar 13, 2026Updated last month
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated last year
- Real-time Implementation of CNN-based selective fixed-filter active noise control and effectiveness analysis using explainable AI☆29Apr 12, 2024Updated 2 years ago
- Simple interactive scene changer☆15Jun 14, 2023Updated 2 years ago
- Simulation and Visualization tool for the Robot Raconteur robotics middleware☆11May 6, 2020Updated 5 years ago
- ☆13Mar 25, 2021Updated 5 years ago
- Object recognition with Pepper using a deep learning model☆10Sep 16, 2021Updated 4 years ago
- A server to recieve facial motion capture data from Live Link Face: https://apps.apple.com/us/app/live-link-face/id1495370836☆14Jan 4, 2023Updated 3 years ago
- [NeurIPS 2023] Official pytorch implementation of "Domain Re-Modulation for Few-Shot Generative Domain Adaption"☆13Aug 2, 2024Updated last year
- Real-time MIDI fuzzy chord and scale identification☆15Nov 8, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- StyleTTS 2 Optimized Training Fork☆33Feb 2, 2025Updated last year
- ☆41May 15, 2023Updated 2 years ago
- Python Client Library for Health Graph API (http://developer.runkeeper.com/healthgraph). The API is used for accessing RunKeeper.com (htt…☆27Aug 10, 2016Updated 9 years ago
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog☆68May 18, 2024Updated last year
- A simple parser libaray for URDF Files. That returns a robot object which can be used to access links, joints, transformation matrices, e…☆15Jan 19, 2022Updated 4 years ago
- Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs☆13Feb 13, 2024Updated 2 years ago
- A real-time implementation of Voice Activity Projection (VAP) is aimed at controlling behaviors of spoken dialogue systems, such as turn-…☆98Jul 24, 2025Updated 8 months ago
- ☆34Jun 15, 2021Updated 4 years ago
- ☆38Apr 3, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Feb 18, 2024Updated 2 years ago
- The code used to create the ARCA23K and ARCA23K-FSD datasets☆15Nov 9, 2021Updated 4 years ago
- A virtual musical instrument built using Google MediaPipe.☆12Oct 10, 2022Updated 3 years ago
- ☆15Oct 24, 2023Updated 2 years ago
- Create your own autonomous agent to help with the development of Godot Engine games.☆28Dec 23, 2025Updated 3 months ago
- ☆19Jan 30, 2023Updated 3 years ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆13Jul 15, 2024Updated last year