ali-vilab / iv-vaeLinks
☆22Updated 6 months ago
Alternatives and similar repositories for iv-vae
Users that are interested in iv-vae are comparing it to the libraries listed below
Sorting:
- ☆38Updated 3 months ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆71Updated 8 months ago
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆57Updated 5 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆85Updated last year
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆38Updated 3 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆146Updated 7 months ago
- Training-Free Text-Guided Image Editing Using Visual Autoregressive Model☆59Updated 5 months ago
- Transition Models☆125Updated last week
- [ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"☆88Updated last year
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆57Updated 11 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆83Updated 10 months ago
- DC-Gen: Accelerating Diffusion Models with Compressed Latent Space☆84Updated 3 weeks ago
- we propose to generate a series of geometric shapes with target colors to disentangle (or peel off ) the target colors from the shapes. B…☆67Updated 11 months ago
- ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models☆77Updated 2 weeks ago
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆67Updated 2 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆69Updated 2 months ago
- Implementation of paper EditCLIP: Representation Learning for Image Editing (ICCV 2025)☆28Updated 2 months ago
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆37Updated last year
- Official PyTorch implementation for ICLR2024 paper "The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing"☆110Updated last year
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆95Updated last year
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆173Updated 6 months ago
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"☆106Updated last year
- Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization☆43Updated last week
- [NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation☆67Updated 11 months ago
- Official model implementation and benchmark evaluation repository of <AnyEdit: Unified High-Quality Image Edit with Any Idea>☆28Updated 2 months ago
- Subjects200K dataset☆118Updated 8 months ago
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆129Updated 5 months ago
- PixNerd: Pixel Neural Field Diffusion☆117Updated last week
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆57Updated 8 months ago
- [ICCV 2025] FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing☆59Updated 3 weeks ago