YAIxPOZAlabs / Improving-TrXL-for-ComMU
YAI 11 x @POZAlabs : Improving & Evaluating Music Generation with ComMU
☆14Updated last year
Alternatives and similar repositories for Improving-TrXL-for-ComMU:
Users that are interested in Improving-TrXL-for-ComMU are comparing it to the libraries listed below
- YAI 11 x @POZAlabs : Music generation & modification from Unclear midi SEquence with Diffusion model☆28Updated last year
- Official repository of Yonsei university AI society☆24Updated 2 months ago
- Studio-YAIVERSE : Text-guided 3D synthesis by GET3D + NADA☆24Updated last year
- Toy Project: Classification and Detection of representative lung diseases, Lung Opacity and COVID-19, from X-Ray Radiography.☆11Updated 3 years ago
- Official code release of "DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding"☆18Updated last month
- YAI 10th x Alchera : Blur Face Detection☆19Updated 2 years ago
- Code for Novel View Acoustic Synthesis paper☆44Updated last year
- ☆32Updated 4 months ago
- Official Implementation (Pytorch) of "Constant Acceleration Flow", NeurIPS 2024☆30Updated 2 weeks ago
- ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer☆25Updated last month
- ☆45Updated 7 months ago
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆41Updated 3 months ago
- Data and Pytorch implementation of IEEE TMM "EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation"☆23Updated 11 months ago
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆23Updated last year
- [NAACL'24] Repository for "SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models"☆12Updated 8 months ago
- Official implementation of "Retrieval-Augmented Score Distillation for Text-to-3D Generation"☆50Updated 2 months ago
- ☆15Updated 4 months ago
- Official implementation of "Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion" (ECCV 2024)☆10Updated 5 months ago
- [ICLR 2025] NeRAF jointly learns acoustic and radiance fields, enabling realistic audio-visual generation.☆17Updated this week
- ☆24Updated last year
- Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)☆14Updated 2 years ago
- Official implementation of the paper "FLAME: Free-form Language-based Motion Synthesis & Editing"☆111Updated last year
- Code for paper "RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text"☆14Updated 8 months ago
- ☆14Updated 9 months ago
- Official Repository for CVPR 2024 paper PEGASUS: Personalized Generative 3D Avatars with Composable Attributes☆58Updated last month
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆22Updated 5 months ago
- ☆47Updated last year
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆82Updated last year
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆28Updated 8 months ago