YAIxPOZAlabs / Improving-TrXL-for-ComMULinks
YAI 11 x @POZAlabs : Improving & Evaluating Music Generation with ComMU
☆13Updated 2 years ago
Alternatives and similar repositories for Improving-TrXL-for-ComMU
Users that are interested in Improving-TrXL-for-ComMU are comparing it to the libraries listed below
Sorting:
- Official repository of Yonsei university AI society☆24Updated 2 months ago
- [NAACL'24] Repository for "SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models"☆13Updated last year
- YAI 11 x @POZAlabs : Music generation & modification from Unclear midi SEquence with Diffusion model☆27Updated last year
- Studio-YAIVERSE : Text-guided 3D synthesis by GET3D + NADA☆23Updated 2 years ago
- ☆15Updated last month
- [INTERSPEECH'24] Official repository for "Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert"☆16Updated 2 months ago
- Foundation Models and Data for Human-Human and Human-AI interactions.☆261Updated 2 weeks ago
- Download scripts and tools for Replay dataset.☆35Updated 2 years ago
- Official code release of "DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding" [ICCV2025 Highlight]☆35Updated last month
- Motion to Dance Music Generation using Latent Diffusion Model☆19Updated last year
- ☆36Updated 4 months ago
- Official implementation of "MoST: Motion Style Transformer between Diverse Action Contents"☆34Updated last year
- Code for Novel View Acoustic Synthesis paper☆50Updated 2 years ago
- ☆29Updated 2 years ago
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis☆29Updated last year
- YAI 10th x Alchera : Blur Face Detection☆19Updated 2 years ago
- Official implementation of the paper "FLAME: Free-form Language-based Motion Synthesis & Editing"☆117Updated last year
- Official code for the paper "Understanding Co-speech Gestures in-the-wild"☆14Updated last week
- [CVPR 2025] SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing☆70Updated 3 months ago
- Data and Pytorch implementation of IEEE TMM "EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation"☆26Updated last year
- Official PyTorch implementation of the paper "A Brand New Dance Partner:Music-Conditioned Pluralistic Dancing Synthesized by Multiple Dan…☆36Updated 3 years ago
- [CVPR 2024] AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion☆130Updated last year
- [CVPR 2024] Arbitrary Motion Style Transfer with Multi-condition Motion Latent Diffusion Model☆73Updated 10 months ago
- Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness (ICASSP 202…☆70Updated last year
- ☆18Updated last year
- [NeurIPS 2024] HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness☆25Updated 10 months ago
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆80Updated 2 years ago
- ☆47Updated last year
- Official implement of "AMD: Autoregressive Motion Diffusion"☆19Updated 9 months ago
- Code release for PianoMotion10M☆90Updated 5 months ago