ali-vilab / iv-vae
☆11Updated last month
Alternatives and similar repositories for iv-vae:
Users that are interested in iv-vae are comparing it to the libraries listed below
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Updated 5 months ago
- ☆31Updated last month
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆54Updated 2 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆79Updated 5 months ago
- Official implementation of LaVin-DiT☆30Updated 2 months ago
- Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers☆49Updated this week
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆26Updated 11 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 9 months ago
- we propose to generate a series of geometric shapes with target colors to disentangle (or peel off ) the target colors from the shapes. B…☆58Updated 6 months ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆69Updated 6 months ago
- ICCV2023-Diffusion-Papers☆109Updated last year
- code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"☆42Updated last year
- The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (N…☆50Updated last year
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆47Updated 2 weeks ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆68Updated 3 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆52Updated last month
- ☆21Updated last year
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆50Updated 6 months ago
- [ICLR2025] ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"☆40Updated last month
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆37Updated 6 months ago
- The repository for AP-LDM☆14Updated 6 months ago
- Official pytorch implementation for SingleInsert☆26Updated last year
- DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models☆46Updated last year
- Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing☆27Updated 4 months ago
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆75Updated last year
- [ICLR 2025] You Only Sample Once: Taming One-Step Text-To-Image Synthesis by Self-Cooperative Diffusion GANs☆55Updated last month
- Stable Consistency Tuning: Understanding and Improving Consistency models☆16Updated 5 months ago
- Official implementation of Aurora☆82Updated last year
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆47Updated 6 months ago
- [TMLR] Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent…☆51Updated 4 months ago