kuai-lab / soundini-officialLinks
We are committing code.
☆44Updated 2 years ago
Alternatives and similar repositories for soundini-official
Users that are interested in soundini-official are comparing it to the libraries listed below
Sorting:
- ☆65Updated last month
- ☆46Updated 11 months ago
- Generate videos that interpolate between two given images☆102Updated last year
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆23Updated last year
- ☆64Updated 2 years ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆42Updated 2 years ago
- ☆16Updated 2 months ago
- Implementation of Transframer, Deepmind's U-net + Transformer architecture for up to 30 seconds video generation, in Pytorch☆71Updated 2 years ago
- Official PyTorch implementation of "Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis…☆44Updated last year
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆80Updated last year
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆84Updated last year
- Implementation of InstructEdit☆75Updated last year
- ☆20Updated 4 months ago
- This repository provides utilities to a minimal dataset for InstructPix2Pix like training for Diffusion models.☆47Updated 2 years ago
- Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)☆15Updated 2 years ago
- [NeurIPS 2024] Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis☆146Updated 7 months ago
- DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models☆46Updated last year
- [NeurIPS 2022: Score-Based Modeling Workshop] Multiresolution Textual Inversion☆99Updated 2 years ago
- ☆21Updated 2 years ago
- ☆33Updated 8 months ago
- My Implementation of " Structure and Content-Guided Video Synthesis with Diffusion Models" by RunwayML☆26Updated last year
- Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge"☆31Updated 3 months ago
- Official implementation of "Perturbed-Attention Guidance"☆57Updated last year
- Code for Novel View Acoustic Synthesis paper☆48Updated last year
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆108Updated last year
- ☆4Updated 9 months ago
- Official code for SeMani (CVPR 2020 oral and Journal extension)☆23Updated last year
- [ECCV 2024] Official Pytorch Implementation for "Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing"☆27Updated last month
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Updated 2 years ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated last year