kaist-ami / SMILE-DatasetLinks
[NAACL'24] Repository for "SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models"
☆13Updated last year
Alternatives and similar repositories for SMILE-Dataset
Users that are interested in SMILE-Dataset are comparing it to the libraries listed below
Sorting:
- Official code for the paper "Understanding Co-speech Gestures in-the-wild"☆14Updated 3 weeks ago
- ☆36Updated 5 months ago
- YAI 11 x @POZAlabs : Improving & Evaluating Music Generation with ComMU☆13Updated 2 years ago
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆80Updated 2 years ago
- Data and Pytorch implementation of IEEE TMM "EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation"☆28Updated last year
- [INTERSPEECH'24] Official repository for "Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert"☆16Updated 2 months ago
- Official code release of "DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding" [ICCV2025 Highlight]☆38Updated last month
- Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)☆15Updated 2 years ago
- Official pytorch implementation for Learning to Listen: Modeling Non-Deterministic Dyadic Facial Motion (CVPR 2022)☆118Updated last year
- ☆17Updated last year
- Foundation Models and Data for Human-Human and Human-AI interactions.☆271Updated last month
- [ECCV 2024] - ScanTalk: 3D Talking Heads from Unregistered Scans☆50Updated 5 months ago
- [CVPR 2025] UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing☆48Updated 5 months ago
- PATS Dataset. Aligned Pose-Audio-Transcripts and Style for co-speech gesture research☆62Updated 2 years ago
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆23Updated last year
- Studio-YAIVERSE : Text-guided 3D synthesis by GET3D + NADA☆23Updated 2 years ago
- [CVPR'25] Official repository for "Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Eva…☆32Updated last month
- Official implementation of the paper "FLAME: Free-form Language-based Motion Synthesis & Editing"☆117Updated last year
- [AAAI 2023 Summer Symposium, Best Paper Award] Taming Diffusion Models for Music-driven Conducting Motion Generation☆26Updated last year
- Official Repository for CVPR 2024 paper PEGASUS: Personalized Generative 3D Avatars with Composable Attributes☆58Updated 8 months ago
- Official implementation of "MoST: Motion Style Transformer between Diverse Action Contents"☆34Updated last year
- ☆35Updated 10 months ago
- [AAAI2025] Official repo for paper "MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls"☆106Updated 8 months ago
- [CVPR 2025] SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing☆71Updated 3 months ago
- Towards Variable and Coordinated Holistic Co-Speech Motion Generation, CVPR 2024☆57Updated last year
- [ICCV23] BallGAN: 3D-aware Image Synthesis with a Spherical Background☆39Updated last year
- Official repository for Muti-human Interactive Talking Dataset☆46Updated last month
- [CVPR 2024] AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion☆132Updated last year
- Debiasing Scores and Prompts of 2D Diffusion for View-consistent Text-to-3D Generation (D-SDS) | NeurIPS 2023☆46Updated last year
- Code for Novel View Acoustic Synthesis paper☆51Updated 2 years ago