YAIxPOZAlabs / Improving-TrXL-for-ComMULinks
YAI 11 x @POZAlabs : Improving & Evaluating Music Generation with ComMU
☆13Updated 2 years ago
Alternatives and similar repositories for Improving-TrXL-for-ComMU
Users that are interested in Improving-TrXL-for-ComMU are comparing it to the libraries listed below
Sorting:
- Official repository of Yonsei university AI society☆24Updated 5 months ago
- YAI 11 x @POZAlabs : Music generation & modification from Unclear midi SEquence with Diffusion model☆26Updated last year
- [NAACL'24] Repository for "SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models"☆15Updated last year
- Studio-YAIVERSE : Text-guided 3D synthesis by GET3D + NADA☆23Updated 2 years ago
- Download scripts and tools for Replay dataset.☆36Updated 2 years ago
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis☆32Updated last year
- Official code for the paper "Understanding Co-speech Gestures in-the-wild"☆20Updated 2 months ago
- Official code release of "DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding" [ICCV2025 Highlight]☆44Updated 3 months ago
- ☆17Updated 5 months ago
- [CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation☆70Updated 2 years ago
- ☆40Updated 8 months ago
- YAI 10th x Alchera : Blur Face Detection☆19Updated 3 years ago
- Toy Project: Classification and Detection of representative lung diseases, Lung Opacity and COVID-19, from X-Ray Radiography.☆11Updated 4 years ago
- [🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …☆25Updated 2 months ago
- Foundation Models and Data for Human-Human and Human-AI interactions.☆328Updated 2 weeks ago
- Motion to Dance Music Generation using Latent Diffusion Model☆23Updated 2 years ago
- Official PyTorch implementation of the paper "A Brand New Dance Partner:Music-Conditioned Pluralistic Dancing Synthesized by Multiple Dan…☆38Updated 3 years ago
- [INTERSPEECH'24] Official repository for "Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert"☆19Updated 6 months ago
- Code for Novel View Acoustic Synthesis paper☆51Updated 2 years ago
- ContactGen: Contact-Guided Interactive 3D Human Generation for Partners (AAAI 2024)☆18Updated last year
- ☆49Updated last year
- [BMVC'25] Official repository for "Learning Correlation-aware Aleatoric Uncertainty for 3D Hand Pose Estimation"☆23Updated 3 weeks ago
- MotionChain: Conversational Motion Controllers via Multimodal Prompts☆68Updated last year
- ☆33Updated 2 years ago
- Code release for PianoMotion10M☆100Updated 9 months ago
- M3GPT: An advanced multimodal, multitask framework for motion comprehension and generation.☆19Updated last year
- ☆16Updated 2 weeks ago
- [NeurIPS'25] Automated Model Discovery via Multi-modal & Multi-step Pipeline☆21Updated 3 weeks ago
- ☆47Updated last year
- Official repo of "Motion-Agent: A Conversational Framework for Human Motion Generation with LLMs"☆106Updated last week