ali-vilab / iv-vae
☆11Updated 2 months ago
Alternatives and similar repositories for iv-vae
Users that are interested in iv-vae are comparing it to the libraries listed below
Sorting:
- ☆30Updated 2 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Updated 5 months ago
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆57Updated 2 months ago
- ☆19Updated 2 years ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Updated last year
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆80Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 9 months ago
- A curated list of papers and resources for text-to-image evaluation.☆29Updated last year
- code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"☆42Updated last year
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆81Updated 5 months ago
- MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation☆21Updated 2 months ago
- [Reward is all you need for few-step diffusion model] Rewards Are Enough for Fast Photo-Realistic Text-to-image Generation☆31Updated last month
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆47Updated 7 months ago
- Implementation of MDP: A Generalized Framework for Text-Guided Image Editing by Manipulating the Diffusion Path☆68Updated last year
- ☆28Updated 2 months ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆68Updated 4 months ago
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆48Updated last month
- [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation☆67Updated last year
- No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves☆23Updated last week
- ☆25Updated 9 months ago
- [ACM MM 2024] Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization☆14Updated 4 months ago
- we propose to generate a series of geometric shapes with target colors to disentangle (or peel off ) the target colors from the shapes. B…☆58Updated 7 months ago
- [ICLR2025] ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"☆41Updated 2 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆57Updated 2 months ago
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"☆41Updated 8 months ago
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆107Updated 11 months ago
- ☆36Updated 2 years ago
- [ICLR 2025] You Only Sample Once: Taming One-Step Text-To-Image Synthesis by Self-Cooperative Diffusion GANs☆55Updated 2 months ago
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆49Updated last year
- Official pytorch implementation for SingleInsert☆26Updated last year