sail-sg / BindDiffusion
BindDiffusion: One Diffusion Model to Bind Them All
☆166Updated last year
Alternatives and similar repositories for BindDiffusion:
Users that are interested in BindDiffusion are comparing it to the libraries listed below
- EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties☆123Updated 5 months ago
- Generate image from anything with ImageBind and Stable Diffusion☆198Updated last year
- ☆171Updated last year
- Fine-tuning "ImageBind One Embedding Space to Bind Them All" with LoRA☆180Updated last year
- Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models☆354Updated last year
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆107Updated last year
- Let's make a video clip☆93Updated 2 years ago
- [IEEE TVCG 2024] Customized Video Generation Using Textual and Structural Guidance☆191Updated last year
- ☆65Updated last year
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆111Updated last year
- ☆147Updated last year
- Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis☆315Updated last year
- Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning☆296Updated 9 months ago
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆129Updated last year
- Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models☆198Updated last year
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆402Updated last year
- Unofficial implementation of Tune-A-Video☆192Updated 2 years ago
- Supercharged BLIP-2 that can handle videos☆117Updated last year
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆281Updated last year
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…☆120Updated 2 years ago
- [WACV 2024] Training-Free Layout Control with Cross-Attention Guidance☆257Updated last year
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation☆477Updated 5 months ago
- ☆82Updated last year
- [NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion Inversion Chain☆149Updated last year
- [SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images☆503Updated 4 months ago
- [IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Model☆103Updated 3 weeks ago
- [NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆143Updated 6 months ago
- ☆173Updated last year
- ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)☆534Updated last year
- Retrieval-Augmented Video Generation for Telling a Story☆256Updated last year