ali-vilab / iv-vaeLinks
☆13Updated 3 months ago
Alternatives and similar repositories for iv-vae
Users that are interested in iv-vae are comparing it to the libraries listed below
Sorting:
- AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model☆29Updated this week
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆28Updated 2 weeks ago
- ☆34Updated last week
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆85Updated 11 months ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆70Updated 5 months ago
- Implementation of MDP: A Generalized Framework for Text-Guided Image Editing by Manipulating the Diffusion Path☆68Updated 2 years ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Updated last year
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆80Updated last year
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆52Updated 8 months ago
- [ICLR 2025] You Only Sample Once: Taming One-Step Text-To-Image Synthesis by Self-Cooperative Diffusion GANs☆61Updated 3 months ago
- code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"☆42Updated last year
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆64Updated 2 weeks ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Updated 7 months ago
- we propose to generate a series of geometric shapes with target colors to disentangle (or peel off ) the target colors from the shapes. B…☆64Updated 8 months ago
- ☆25Updated 10 months ago
- [ICLR2025] ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"☆41Updated 3 months ago
- Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆57Updated 3 weeks ago
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆39Updated 8 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆81Updated 7 months ago
- Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation☆38Updated last year
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆49Updated 8 months ago
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆39Updated last year
- Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)☆24Updated last year
- Unofficial implementation of Layer Diffuse in diffusers☆27Updated last year
- [TMLR] Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent…☆51Updated 6 months ago
- ☆53Updated last year
- ☆21Updated last year
- The repository for AP-LDM☆14Updated 8 months ago
- Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)☆46Updated 2 months ago
- Official implementation of "Divide & Bind Your Attention for Improved Generative Semantic Nursing" (BMVC 2023 Oral)☆36Updated last year