Foundation Models and Data for Human-Human and Human-AI interactions.
☆372Dec 13, 2025Updated 4 months ago
Alternatives and similar repositories for seamless_interaction
Users that are interested in seamless_interaction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of the Paper:Motion-example-controlled Co-speech Gesture Generation Leveraging Large Language Models (Siggraph 20…☆27Mar 29, 2026Updated last month
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation☆64Apr 23, 2025Updated last year
- ARTalk generates realistic 3D head motions (lip sync, blinking, expressions, head poses) from audio in ⚡ real-time ⚡.☆129Mar 11, 2026Updated last month
- The official SpeakerVid-5M data curation code.☆73Jul 23, 2025Updated 9 months ago
- Goliath Dataset and Official PyTorch Implementation of RelightableHands, Relightable Gaussian Codec Avatars, and Driving-Signal Aware Ful…☆351Dec 4, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository is the official implementation of Human4DiT: 360-degree Human Video Generation with 4D Diffusion Transformer.☆111Nov 18, 2025Updated 5 months ago
- [EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E…☆31Jul 11, 2025Updated 9 months ago
- Official release of StyleTalk dataset.☆72Jul 1, 2024Updated last year
- Code Repository for Paper "SphereHead: Stable 3D Full-head Synthesis with Spherical Tri-plane Representation"☆125Mar 10, 2026Updated last month
- Pytorch implementation of Unimotion: Unifying 3D Human Motion Synthesis and Understanding.☆99Apr 13, 2025Updated last year
- Official Code Base of the Paper: "Joker: Conditional 3D Head Synthesis with Extreme Facial Expressions"☆53Jan 31, 2025Updated last year
- Train universal codec avatars☆215Jun 17, 2025Updated 10 months ago
- Code for CVPR 2024 paper: ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis☆36Apr 29, 2025Updated last year
- Code Repository for MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos (ECCV 2024)☆128Nov 10, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- SynShot - Synthetic Prior for Few-Shot Drivable Head Avatar Inversion [CVPR 2025]☆44May 18, 2025Updated 11 months ago
- A complete head tracking pipeline from videos to NeRF/3DGS-ready datasets.☆358Jul 19, 2025Updated 9 months ago
- [Open-source Project] UniMoCap: community implementation to unify the text-motion datasets (HumanML3D, KIT-ML, and BABEL) and whole-body …☆201Jan 4, 2024Updated 2 years ago
- Dataset for paper "OmniMotion-X: Versatile Multimodal Whole-Body Motion Generation"☆23Dec 22, 2025Updated 4 months ago
- The official implementation of the paper "Affective Faces for Goal-Driven Dyadic Communication."☆15Jan 27, 2023Updated 3 years ago
- [CVPR 2025] HumanMM: Global Human Motion Recovery from Multi-shot Videos☆118Mar 20, 2025Updated last year
- Convert bvh files to smplx parameters, with mesh and joint renderer.☆27May 6, 2025Updated 11 months ago
- [AAAI 2025] The official repository of UniMuMo☆129Sep 14, 2025Updated 7 months ago
- [NeurIPS 2024] Generalizable and Animatable Gaussian Head Avatar☆74Mar 13, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆76Dec 3, 2025Updated 4 months ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆22Apr 10, 2026Updated 2 weeks ago
- Official Implementation of AQ-GT: a Temporally Aligned and Quantized GRU-Transformer for Co-Speech Gesture Synthesis with the extension (…☆21Apr 19, 2024Updated 2 years ago
- ☆22Apr 17, 2024Updated 2 years ago
- [CVPR 2024] Official implementation of the paper "Towards Versatile Human-Human Interaction Analysis"☆221Aug 11, 2024Updated last year
- ☆88Dec 19, 2025Updated 4 months ago
- ☆439Dec 11, 2024Updated last year
- [CVPR 2025] Official repo for "FATE: Full-head Gaussian Avatar with Textural Editing from Monocular Video"☆116Sep 7, 2025Updated 7 months ago
- Pytorch implementation of "Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal T…☆12Mar 9, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- LHM Video Dataset processing☆67Apr 8, 2025Updated last year
- Character Animation Tools for Python.☆264Dec 19, 2023Updated 2 years ago
- [CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"☆144Mar 16, 2023Updated 3 years ago
- LUCY: Linguistic Understanding and Control Yielding Early Stage of Her☆60Apr 14, 2025Updated last year
- ☆35Jul 25, 2023Updated 2 years ago
- R3-Avatar: Record and Retrieve Temporal Codebook for Reconstructing Photorealistic Human Avatars☆23Nov 23, 2025Updated 5 months ago
- [ICCV 2025] MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space☆268Oct 28, 2025Updated 6 months ago