Foundation Models and Data for Human-Human and Human-AI interactions.
☆356Dec 13, 2025Updated 2 months ago
Alternatives and similar repositories for seamless_interaction
Users that are interested in seamless_interaction are comparing it to the libraries listed below
Sorting:
- The official SpeakerVid-5M data curation code.☆68Jul 23, 2025Updated 7 months ago
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation☆62Apr 23, 2025Updated 10 months ago
- Pytorch implementation of Unimotion: Unifying 3D Human Motion Synthesis and Understanding.☆94Apr 13, 2025Updated 10 months ago
- [EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E…☆28Jul 11, 2025Updated 7 months ago
- Official release of StyleTalk dataset.☆72Jul 1, 2024Updated last year
- ARTalk generates realistic 3D head motions (lip sync, blinking, expressions, head poses) from audio in ⚡ real-time ⚡.☆121Jun 12, 2025Updated 8 months ago
- Goliath Dataset and Official PyTorch Implementation of RelightableHands, Relightable Gaussian Codec Avatars, and Driving-Signal Aware Ful…☆343Dec 4, 2024Updated last year
- Convert bvh files to smplx parameters, with mesh and joint renderer.☆25May 6, 2025Updated 10 months ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆77Dec 3, 2025Updated 3 months ago
- [Open-source Project] UniMoCap: community implementation to unify the text-motion datasets (HumanML3D, KIT-ML, and BABEL) and whole-body …☆197Jan 4, 2024Updated 2 years ago
- [CVPR 2025] HumanMM: Global Human Motion Recovery from Multi-shot Videos☆118Mar 20, 2025Updated 11 months ago
- Official Implementation of the Paper:Motion-example-controlled Co-speech Gesture Generation Leveraging Large Language Models (Siggraph 20…☆24Dec 18, 2025Updated 2 months ago
- [ICCV 2025] MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space☆259Oct 28, 2025Updated 4 months ago
- SynShot - Synthetic Prior for Few-Shot Drivable Head Avatar Inversion [CVPR 2025]☆44May 18, 2025Updated 9 months ago
- JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit☆44May 26, 2025Updated 9 months ago
- [CVPR 2024] Official implementation of the paper "Towards Versatile Human-Human Interaction Analysis"☆212Aug 11, 2024Updated last year
- MotionGPT3: Human Motion as a Second Modality, a MoT-based framework for unified motion understanding and generation☆176Jan 14, 2026Updated last month
- Official Code Base of the Paper: "Joker: Conditional 3D Head Synthesis with Extreme Facial Expressions"☆52Jan 31, 2025Updated last year
- ☆21Apr 17, 2024Updated last year
- Code Repository for MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos (ECCV 2024)☆126Nov 10, 2024Updated last year
- [CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"☆144Mar 16, 2023Updated 2 years ago
- ☆61Aug 3, 2024Updated last year
- [AAAI 2025] The official repository of UniMuMo☆129Sep 14, 2025Updated 5 months ago
- LHM Video Dataset processing☆64Apr 8, 2025Updated 11 months ago
- This repository is the official implementation of Human4DiT: 360-degree Human Video Generation with 4D Diffusion Transformer.☆107Nov 18, 2025Updated 3 months ago
- [NeurIPS 2025] Benchmark data and code for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix☆197Feb 25, 2026Updated last week
- Train universal codec avatars☆211Jun 17, 2025Updated 8 months ago
- Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM☆17Nov 7, 2024Updated last year
- The official implementation of the paper "Affective Faces for Goal-Driven Dyadic Communication."☆15Jan 27, 2023Updated 3 years ago
- Large-Vocabulary Continuous Sign Language Recognition, 2024☆15May 30, 2024Updated last year
- Code Repository for Paper "SphereHead: Stable 3D Full-head Synthesis with Spherical Tri-plane Representation"☆107Mar 17, 2025Updated 11 months ago
- List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.☆31Apr 13, 2023Updated 2 years ago
- MotionFix: Text-Driven 3D Human Motion Editing [SIGGRAPH ASIA 2024]☆155Feb 27, 2026Updated last week
- [NeurIPS 2024] Generalizable and Animatable Gaussian Head Avatar☆70Mar 13, 2025Updated 11 months ago
- Character Animation Tools for Python.☆262Dec 19, 2023Updated 2 years ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆21Dec 22, 2025Updated 2 months ago
- Official Implementation of AQ-GT: a Temporally Aligned and Quantized GRU-Transformer for Co-Speech Gesture Synthesis with the extension (…☆21Apr 19, 2024Updated last year
- Official code for the paper "Understanding Co-speech Gestures in-the-wild"☆20Oct 31, 2025Updated 4 months ago
- [CVPR 2025] MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities☆32Apr 6, 2025Updated 11 months ago