(CVPR 2025) Scailing Down Text Encoders of Text-to-Image Diffusion Models
☆52Sep 10, 2025Updated 5 months ago
Alternatives and similar repositories for DistillT5
Users that are interested in DistillT5 are comparing it to the libraries listed below
Sorting:
- Video Diffusion Transformers are In-Context Learners☆35Jan 6, 2025Updated last year
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated 11 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69May 18, 2025Updated 9 months ago
- Official implementation of Aurora☆85Sep 20, 2023Updated 2 years ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- ☆16Sep 1, 2025Updated 6 months ago
- [ICCV 2025] Official repository of DiffSim: Taming Diffusion Models for Evaluating Visual Similarity☆30Jul 14, 2025Updated 7 months ago
- ☆31Sep 1, 2025Updated 6 months ago
- [CVPR 2025 GMCV] Test-Time Frequency Scaling: Instant Frequency Control for Any Diffusion Model☆55May 31, 2025Updated 9 months ago
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆28Aug 19, 2024Updated last year
- The official implementation of "Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers" (arXiv …☆51Jun 6, 2025Updated 9 months ago
- Code for Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning☆36Jun 16, 2024Updated last year
- ☆16May 13, 2025Updated 9 months ago
- ☆36Updated this week
- ☆28Sep 4, 2025Updated 6 months ago
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"☆311Sep 28, 2025Updated 5 months ago
- An open-source implementation of Regional Adaptive Sampling (RAS), a novel diffusion model sampling strategy that introduces regional var…☆150Jun 25, 2025Updated 8 months ago
- Official repository for CVPR 2025 paper: OpenSDI: Spotting Diffusion-Generated Images in the Open World☆41Jul 8, 2025Updated 8 months ago
- [ICLR 2025] Trajectory Attention For Fine-grained Video Motion Control☆99May 13, 2025Updated 9 months ago
- Visualize and manipulate the latent space in ComfyUI☆25Jan 14, 2026Updated last month
- [ICCV 2025] The Curse of Conditions: Analyzing and Improving Optimal Transport for Conditional Flow-Based Generation☆22Oct 12, 2025Updated 4 months ago
- dinov2 features aligned with CLIP☆21Jul 9, 2024Updated last year
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆41Jan 9, 2026Updated 2 months ago
- Official PyTorch Implementation for Readout Guidance, CVPR 2024☆152Jun 26, 2025Updated 8 months ago
- ☆49Feb 9, 2026Updated last month
- stochastic bfloat16 based optimizer library☆21Dec 4, 2024Updated last year
- Exploring Representation-Aligned Latent Space for Better Generation☆17Feb 4, 2025Updated last year
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆131May 16, 2025Updated 9 months ago
- 🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"☆166Jul 10, 2025Updated 8 months ago
- Official implementation of Inductive Moment Matching☆576Jul 11, 2025Updated 7 months ago
- ☆20Jan 1, 2026Updated 2 months ago
- [CVPR 2024] BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation☆45May 7, 2024Updated last year
- [ICCV 2025] Edicho: Consistent Image Editing in the Wild☆124Oct 22, 2025Updated 4 months ago
- https://github.com/xie-lab-ml/Golden-Noise-for-Diffusion-Models for ComfyUI☆17Dec 10, 2024Updated last year
- ☆17Feb 20, 2025Updated last year
- Official implementation of MTM☆21Aug 30, 2023Updated 2 years ago
- Video Diffusion State Space Models☆19Mar 27, 2024Updated last year
- ☆18Oct 23, 2024Updated last year
- DALL-E for Detection: Language-driven Compositional Image Synthesis for Object Detection☆21Oct 5, 2023Updated 2 years ago