Huage001 / CLEAR
Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".
☆199Updated last month
Alternatives and similar repositories for CLEAR:
Users that are interested in CLEAR are comparing it to the libraries listed below
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆294Updated 2 months ago
- Code for FreeScale, a tuning-free method for higher-resolution visual generation☆118Updated 2 weeks ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆137Updated last month
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆198Updated 2 months ago
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆191Updated last month
- The code of our work "Golden Noise for Diffusion Models: A Learning Framework".☆144Updated last month
- ☆191Updated last month
- Subjects200K dataset☆103Updated 2 months ago
- ☆113Updated 5 months ago
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control☆182Updated 2 months ago
- ☆49Updated 2 months ago
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers☆110Updated 2 months ago
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆142Updated 4 months ago
- Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization☆186Updated 3 months ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]☆90Updated last month
- [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation☆172Updated last month
- ☆87Updated 8 months ago
- ☆80Updated 3 months ago
- The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".☆145Updated 3 months ago
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆158Updated 11 months ago
- [Arxiv 2024] Edicho: Consistent Image Editing in the Wild☆114Updated 2 months ago
- GenEval: An object-focused framework for evaluating text-to-image alignment☆197Updated 2 weeks ago
- [CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Framework for Cross-Modality Evol …☆142Updated last week
- VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE☆297Updated 2 months ago
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控…☆123Updated 8 months ago
- [NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆149Updated 4 months ago
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation☆254Updated 3 weeks ago