YAIxPOZAlabs / Improving-TrXL-for-ComMULinks
YAI 11 x @POZAlabs : Improving & Evaluating Music Generation with ComMU
☆13Updated 2 years ago
Alternatives and similar repositories for Improving-TrXL-for-ComMU
Users that are interested in Improving-TrXL-for-ComMU are comparing it to the libraries listed below
Sorting:
- Official repository of Yonsei university AI society☆24Updated 3 months ago
- [NAACL'24] Repository for "SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models"☆13Updated last year
- YAI 11 x @POZAlabs : Music generation & modification from Unclear midi SEquence with Diffusion model☆26Updated last year
- Official code for the paper "Understanding Co-speech Gestures in-the-wild"☆16Updated 2 months ago
- Studio-YAIVERSE : Text-guided 3D synthesis by GET3D + NADA☆23Updated 2 years ago
- [INTERSPEECH'24] Official repository for "Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert"☆17Updated 4 months ago
- Official code release of "DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding" [ICCV2025 Highlight]☆40Updated last month
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis☆31Updated last year
- Download scripts and tools for Replay dataset.☆35Updated 2 years ago
- Foundation Models and Data for Human-Human and Human-AI interactions.☆300Updated 2 months ago
- Official implementation of the paper "FLAME: Free-form Language-based Motion Synthesis & Editing"☆118Updated last year
- ☆36Updated 6 months ago
- Toy Project: Classification and Detection of representative lung diseases, Lung Opacity and COVID-19, from X-Ray Radiography.☆11Updated 4 years ago
- ICCV 2025☆55Updated last month
- Code for Novel View Acoustic Synthesis paper☆51Updated 2 years ago
- Official PyTorch implementation of the paper "A Brand New Dance Partner:Music-Conditioned Pluralistic Dancing Synthesized by Multiple Dan…☆37Updated 3 years ago
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆80Updated 2 years ago
- Data and Pytorch implementation of IEEE TMM "EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation"☆29Updated last year
- ☆14Updated 2 months ago
- ☆16Updated 3 months ago
- Motion to Dance Music Generation using Latent Diffusion Model☆19Updated last year
- [CVPR 2025] SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing☆83Updated 4 months ago
- [CVPR 2024] AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion☆133Updated last year
- Official implementation of "MoST: Motion Style Transformer between Diverse Action Contents"☆35Updated last year
- Official pytorch implementation for Learning to Listen: Modeling Non-Deterministic Dyadic Facial Motion (CVPR 2022)☆123Updated last year
- [CVPR 2024] Arbitrary Motion Style Transfer with Multi-condition Motion Latent Diffusion Model☆74Updated last year
- Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)☆33Updated last month
- [ECCV 2024 Oral] Audio-Synchronized Visual Animation☆56Updated last year
- YAI 10th x Alchera : Blur Face Detection☆19Updated 2 years ago
- Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness (ICASSP 202…☆71Updated last year