kuai-lab / soundini-official
We are committing code.
☆44Updated last year
Alternatives and similar repositories for soundini-official:
Users that are interested in soundini-official are comparing it to the libraries listed below
- ☆63Updated 2 months ago
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆23Updated last year
- ☆20Updated last month
- Official PyTorch implementation of "Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis…☆44Updated last year
- official implementation of the paper: Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transform…☆29Updated last year
- Dense Interspecies Face Embedding (NeurIPS 2022)☆24Updated last year
- Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)☆15Updated 2 years ago
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆81Updated last year
- SMILE: A Multimodal Dataset for Understanding Laughter☆14Updated last year
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆33Updated last year
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆106Updated last year
- Evaluation benchmark for the task of Semantic Image Translation. Contains code to run FlexIT (CVPR 2022)☆35Updated 3 years ago
- [ICLR'24] Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition☆37Updated 10 months ago
- [ECCV 2024] Official Pytorch Implementation for "Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing"☆25Updated last month
- Unofficial Implementation of E-LatentLPIPS(Ensembled-LatentLPIPS) of Diffusion2GAN☆40Updated 8 months ago
- Generate videos that interpolate between two given images☆97Updated last year
- Official PyTorch implementation of "Generalized Consistency Trajectory Models for Image Manipulation"☆37Updated last year
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆41Updated last year
- ☆65Updated last year
- ☆45Updated 7 months ago
- ☆13Updated 6 months ago
- DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models☆46Updated last year
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆80Updated 8 months ago
- Code for Novel View Acoustic Synthesis paper☆45Updated last year
- ☆30Updated 4 months ago
- Website source files for Diffusion2GAN Project.☆78Updated 6 months ago
- The official implementation of the paper "Affective Faces for Goal-Driven Dyadic Communication."☆15Updated 2 years ago
- ☆14Updated 3 weeks ago
- Official code for SeMani (CVPR 2020 oral and Journal extension)☆23Updated last year
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated 2 years ago