o0o0o00o0 / AlphaVAELinks
☆61Updated last week
Alternatives and similar repositories for AlphaVAE
Users that are interested in AlphaVAE are comparing it to the libraries listed below
Sorting:
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆64Updated last year
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆71Updated 5 months ago
- ☆106Updated last year
- Conceptrol: Concept Control of Zero-shot Personalized Image Generation☆44Updated 9 months ago
- [ICCV 2025] FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing☆70Updated 3 months ago
- Subjects200K dataset☆128Updated 11 months ago
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)☆128Updated last year
- ☆34Updated last year
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)☆95Updated 9 months ago
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"☆106Updated last year
- ☆41Updated 11 months ago
- [ICCV 2025] Edicho: Consistent Image Editing in the Wild☆123Updated 2 months ago
- ☆91Updated last year
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]☆137Updated 11 months ago
- Official repository of IDEA-Bench☆37Updated 11 months ago
- [NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models☆119Updated last year
- Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"☆87Updated 6 months ago
- The public source code of "FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling"☆27Updated 5 months ago
- LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models☆66Updated last year
- ☆104Updated last year
- [ICCV2025] Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".☆61Updated 6 months ago
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation☆44Updated 6 months ago
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆37Updated last year
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆65Updated 7 months ago
- This is an official repository for the paper, NoiseCollage, which is a revolutionary extension of text-to-image diffusion models for layo…☆63Updated last year
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Updated 2 months ago
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆71Updated 5 months ago
- PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation☆36Updated last year
- [ECCV 2024] Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models☆88Updated last year
- [WACV 2025] MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning☆96Updated 8 months ago