Foundation Models and Data for Human-Human and Human-AI interactions.
☆384Dec 13, 2025Updated 5 months ago
Alternatives and similar repositories for seamless_interaction
Users that are interested in seamless_interaction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of the Paper:Motion-example-controlled Co-speech Gesture Generation Leveraging Large Language Models (Siggraph 20…☆32Mar 29, 2026Updated 2 months ago
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation☆65Apr 23, 2025Updated last year
- ARTalk generates realistic 3D head motions (lip sync, blinking, expressions, head poses) from audio in ⚡ real-time ⚡.☆132May 19, 2026Updated 3 weeks ago
- The official SpeakerVid-5M data curation code.☆75Jul 23, 2025Updated 10 months ago
- Goliath Dataset and Official PyTorch Implementation of RelightableHands, Relightable Gaussian Codec Avatars, and Driving-Signal Aware Ful…☆357Dec 4, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This repository is the official implementation of Human4DiT: 360-degree Human Video Generation with 4D Diffusion Transformer.☆113Nov 18, 2025Updated 6 months ago
- [EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E…☆31Jul 11, 2025Updated 10 months ago
- Official release of StyleTalk dataset.☆74Jul 1, 2024Updated last year
- Code Repository for Paper "SphereHead: Stable 3D Full-head Synthesis with Spherical Tri-plane Representation"☆112Mar 10, 2026Updated 2 months ago
- Pytorch implementation of Unimotion: Unifying 3D Human Motion Synthesis and Understanding.☆99Apr 13, 2025Updated last year
- Official Code Base of the Paper: "Joker: Conditional 3D Head Synthesis with Extreme Facial Expressions"☆53Jan 31, 2025Updated last year
- Train universal codec avatars☆218Jun 17, 2025Updated 11 months ago
- Code for CVPR 2024 paper: ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis☆36Apr 29, 2025Updated last year
- Code Repository for MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos (ECCV 2024)☆129Nov 10, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- SynShot - Synthetic Prior for Few-Shot Drivable Head Avatar Inversion [CVPR 2025]☆44May 18, 2025Updated last year
- A complete head tracking pipeline from videos to NeRF/3DGS-ready datasets.☆368Jul 19, 2025Updated 10 months ago
- [Open-source Project] UniMoCap: community implementation to unify the text-motion datasets (HumanML3D, KIT-ML, and BABEL) and whole-body …☆202Jan 4, 2024Updated 2 years ago
- The official implementation of the paper "Affective Faces for Goal-Driven Dyadic Communication."☆15Jan 27, 2023Updated 3 years ago
- Dataset for paper "OmniMotion-X: Versatile Multimodal Whole-Body Motion Generation"☆23Dec 22, 2025Updated 5 months ago
- [CVPR 2025] HumanMM: Global Human Motion Recovery from Multi-shot Videos☆118Mar 20, 2025Updated last year
- Convert bvh files to smplx parameters, with mesh and joint renderer.☆28May 6, 2025Updated last year
- [AAAI 2025] The official repository of UniMuMo☆130Sep 14, 2025Updated 8 months ago
- [NeurIPS 2024] Generalizable and Animatable Gaussian Head Avatar☆75Mar 13, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆76Dec 3, 2025Updated 6 months ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆23Apr 10, 2026Updated last month
- Official Implementation of AQ-GT: a Temporally Aligned and Quantized GRU-Transformer for Co-Speech Gesture Synthesis with the extension (…☆21Apr 19, 2024Updated 2 years ago
- ☆22Apr 17, 2024Updated 2 years ago
- [CVPR 2024] Official implementation of the paper "Towards Versatile Human-Human Interaction Analysis"☆222Aug 11, 2024Updated last year
- ☆89Dec 19, 2025Updated 5 months ago
- ☆447Dec 11, 2024Updated last year
- [CVPR 2025] Official repo for "FATE: Full-head Gaussian Avatar with Textural Editing from Monocular Video"☆118Sep 7, 2025Updated 9 months ago
- Pytorch implementation of "Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal T…☆12Apr 29, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- LHM Video Dataset processing☆69Apr 8, 2025Updated last year
- Character Animation Tools for Python.☆266Dec 19, 2023Updated 2 years ago
- [CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"☆144Mar 16, 2023Updated 3 years ago
- LUCY: Linguistic Understanding and Control Yielding Early Stage of Her☆60Apr 14, 2025Updated last year
- ☆35Jul 25, 2023Updated 2 years ago
- R3-Avatar: Record and Retrieve Temporal Codebook for Reconstructing Photorealistic Human Avatars☆23Nov 23, 2025Updated 6 months ago
- ☆61Aug 3, 2024Updated last year