YAIxPOZAlabs / Improving-TrXL-for-ComMU
YAI 11 x @POZAlabs : Improving & Evaluating Music Generation with ComMU
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Improving-TrXL-for-ComMU
- YAI 11 x @POZAlabs : Music generation & modification from Unclear midi SEquence with Diffusion model☆27Updated 9 months ago
- Official repository of Yonsei university AI society☆23Updated 2 months ago
- YAI 10th x Alchera : Blur Face Detection☆17Updated 2 years ago
- Toy Project: Classification and Detection of representative lung diseases, Lung Opacity and COVID-19, from X-Ray Radiography.☆10Updated 3 years ago
- Korean Streaming ASR(with Denoiser and Conformer CTC)☆19Updated 6 months ago
- Studio-YAIVERSE : Text-guided 3D synthesis by GET3D + NADA☆23Updated last year
- Official code release of "DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation"☆13Updated 2 months ago
- ☆14Updated 7 months ago
- ☆28Updated last month
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆81Updated last year
- Code for Novel View Acoustic Synthesis paper☆44Updated last year
- ☆14Updated last month
- [NAACL'24] Repository for "SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models"☆10Updated 5 months ago
- Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).☆23Updated 7 months ago
- ☆23Updated 2 months ago
- ☆29Updated 11 months ago
- Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)☆14Updated last year
- Official implementation of the paper "FLAME: Free-form Language-based Motion Synthesis & Editing"☆108Updated 10 months ago
- Efficient synchronization from sparse cues☆28Updated 6 months ago
- ☆22Updated last year
- Official Code Repository for the paper "Grid Diffusion Models for Text-to-Video Generation", CVPR 2024☆18Updated 2 months ago
- Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis☆37Updated last year
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆18Updated 3 months ago
- Official implementation of "Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion" (ECCV 2024)☆9Updated 2 months ago
- The Introduction of the OLKAVS Dataset☆30Updated 5 months ago
- Official implementation of "Retrieval-Augmented Score Distillation for Text-to-3D Generation"☆48Updated 6 months ago
- ☆20Updated 3 weeks ago
- ☆45Updated last year
- Code for "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces" ACM MM 2023☆30Updated last year
- Official Implementation (Pytorch) of "Constant Acceleration Flow", NeurIPS 2024☆20Updated this week