kuai-lab / sound-guided-semantic-image-manipulation
Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)
☆81Updated last year
Related projects: ⓘ
- ☆128Updated last year
- ☆25Updated 8 months ago
- [NAACL'24] Repository for "SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models"☆9Updated 3 months ago
- Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)☆29Updated last year
- ☆51Updated 3 years ago
- PseudoDiffusers: paper/code review and experimental findings related to computer vision generation and diffusion-based models☆42Updated last week
- [NeurIPS'22] Official code of "ComMU: Dataset for Combinatorial Music Generation"☆139Updated last year
- [AAAI-24] VVS : Video-to-Video Retrieval With Irrelevant Frame Suppression☆18Updated 4 months ago
- 2023 Spring SNU Computer Vision Project☆14Updated last year
- YAI 11 x @POZAlabs : Music generation & modification from Unclear midi SEquence with Diffusion model☆27Updated 7 months ago
- Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)☆13Updated 2 years ago
- ☆10Updated 2 months ago
- Official Code Repository for the paper "Grid Diffusion Models for Text-to-Video Generation", CVPR 2024☆13Updated 3 weeks ago
- Official PyTorch implementation of Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks (ICLR 2022).☆182Updated last year
- Official implementation of the paper "FLAME: Free-form Language-based Motion Synthesis & Editing"☆108Updated 8 months ago
- Simple Tensorflow implementation of "Toward Spatially Unbiased Generative Models" (ICCV 2021)☆15Updated 2 years ago
- Official Pytorch implementation of GGDR (ECCV 2022)☆102Updated 2 years ago
- Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)☆15Updated last year
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆19Updated 9 months ago
- [ICCV23] BallGAN: 3D-aware Image Synthesis with a Spherical Background☆38Updated 2 weeks ago
- Efficient synchronization from sparse cues☆25Updated 4 months ago
- [IJCAI-2022] Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?☆24Updated 2 years ago
- Toward Spatially Unbiased Generative Models (ICCV 2021)☆90Updated 3 years ago
- Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation☆23Updated 2 years ago
- 2023 한국어 AI 경진대회☆12Updated 10 months ago
- [ICLR-2023] Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized Images☆59Updated 2 years ago
- An official implementation of "Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encod…☆144Updated 11 months ago
- Official repository of Yonsei university AI society☆23Updated 3 weeks ago
- Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".☆16Updated last month
- The app for visualizing allocated GPUs by SLURM☆10Updated 7 months ago