[ACL 2025] Predicting Turn-Taking and Backchannel in Human-Machine Conversations Using Linguistic, Acoustic, and Visual Signals
☆32Aug 11, 2025Updated 10 months ago
Alternatives and similar repositories for MM-F2F
Users that are interested in MM-F2F are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Menagerie of video models trained on various video datasets☆10Oct 13, 2024Updated last year
- ☆17Apr 12, 2021Updated 5 years ago
- [ICLR 2025] Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning (SASR)☆11Aug 26, 2025Updated 9 months ago
- ☆14May 20, 2025Updated last year
- StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and imp…☆13Jul 19, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆12Jan 28, 2022Updated 4 years ago
- ☆22Apr 5, 2026Updated 2 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated 2 years ago
- Active Noise Control Headphone Implementation using Deep Learning-based GFANC☆13Aug 28, 2025Updated 9 months ago
- Unsupervised deep convolutional neural network model for the ventral visual stream.☆29Mar 24, 2023Updated 3 years ago
- This is the code for EEGDnet: Fusing non-local and local self-similarity for EEG signal denoising with transformer☆17Jun 3, 2024Updated 2 years ago
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Jan 12, 2025Updated last year
- Voice Activity Projection Models: Self-supervised learning of Turn-taking Events☆103May 29, 2024Updated 2 years ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆22Feb 3, 2023Updated 3 years ago
- Real-time Implementation of CNN-based selective fixed-filter active noise control and effectiveness analysis using explainable AI☆30Apr 12, 2024Updated 2 years ago
- This work provides a MATLAB code for the McFxLMS algorithm, which can be used for the arbitrary number of channels system.☆30Feb 18, 2024Updated 2 years ago
- ☆13Mar 25, 2021Updated 5 years ago
- [NeurIPS 2023] Official pytorch implementation of "Domain Re-Modulation for Few-Shot Generative Domain Adaption"☆13Aug 2, 2024Updated last year
- StyleTTS 2 Optimized Training Fork☆32Feb 2, 2025Updated last year
- ☆41May 15, 2023Updated 3 years ago
- Python Client Library for Health Graph API (http://developer.runkeeper.com/healthgraph). The API is used for accessing RunKeeper.com (htt…☆27Aug 10, 2016Updated 9 years ago
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog☆69May 18, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs☆13Feb 13, 2024Updated 2 years ago
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 10 months ago
- A real-time implementation of Voice Activity Projection (VAP) is aimed at controlling behaviors of spoken dialogue systems, such as turn-…☆101Jul 24, 2025Updated 10 months ago
- ☆34Jun 15, 2021Updated 5 years ago
- ☆39Apr 3, 2025Updated last year
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Feb 18, 2024Updated 2 years ago
- To overcome the limitation and obtain more appropriate control filters, a generative fixed-filter active noise control (GFANC) approach i…☆38Aug 28, 2025Updated 9 months ago
- ☆15Oct 24, 2023Updated 2 years ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆13Jul 15, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The code used to create the ARCA23K and ARCA23K-FSD datasets☆16Nov 9, 2021Updated 4 years ago
- Node.js Client for RunKeeper Health Graph API; based originally off an existing fork of node-runkeeper that appears to no longer be activ…☆29May 18, 2019Updated 7 years ago
- Neural Network based Sound Source Localization Models☆51Aug 29, 2023Updated 2 years ago
- 一个基于原生浏览器书签的知识库:用 GitHub Gist 跨浏览器同步书签,并用 AI 为书签生成摘要、标签和封面,提供一个简洁的 Web 端浏览体验。☆32May 25, 2026Updated 3 weeks ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- Open-source text-to-speech model from KRAFTON trained exclusively on public speech data, with curated datasets and reproducible training …☆69May 21, 2026Updated 3 weeks ago
- ☆10Oct 17, 2021Updated 4 years ago