CVPR 2026-Guiding a Diffusion Transformer with the Internal Dynamics of Itself (IG)
☆60Feb 26, 2026Updated last week
Alternatives and similar repositories for Internal-Guidance
Users that are interested in Internal-Guidance are comparing it to the libraries listed below
Sorting:
- ICLR 2026-MVAR: Visual Autoregressive Modeling with Scale and Spatial Markovian Conditioning☆35Feb 9, 2026Updated 3 weeks ago
- [CVPR 2025] Uncertainty-guided Perturbation for Image Super-Resolution Diffusion Model☆61Jun 22, 2025Updated 8 months ago
- (CVPR2025) Learned Image Compression with Dictionary-based Entropy Model☆77Oct 23, 2025Updated 4 months ago
- [NeurIPS2024] Causal Context Adjustment Loss for Learned Image Compression☆66Apr 4, 2025Updated 11 months ago
- ☆23Oct 15, 2024Updated last year
- Official repository of SoftREPA: Aligning Text to Image in Diffusion Models is Easier Than You Think☆19Jun 5, 2025Updated 8 months ago
- Minute-long video generation at 24FPS.☆50Feb 2, 2026Updated last month
- ICCV 2025-PerLDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models☆53Jan 5, 2026Updated last month
- The official repo for the DanQing dataset.☆30Jan 16, 2026Updated last month
- ☆42Jan 19, 2026Updated last month
- The official code of FeRA: Frequency–Energy Constrained Routing for Effective Diffusion Adaptation Fine-Tuning☆28Dec 27, 2025Updated 2 months ago
- [ECCV 2024] BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion☆21Jul 2, 2024Updated last year
- [ICLR 2026] Self-Representation Alignment for Diffusion Transformers (SRA)☆111Feb 22, 2026Updated last week
- CVPR2025-Progressive Focused Transformer for Single Image Super-Resolution☆160Jan 17, 2026Updated last month
- ☆23Mar 25, 2024Updated last year
- [CVPR 2024] Transcending the Limit of Local Window: Advanced Super-Resolution Transformer with Adaptive Token Dictionary☆192Mar 24, 2025Updated 11 months ago
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆47Jun 13, 2024Updated last year
- [NeurIPS 2024] Official implementation of "Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models."☆55Aug 14, 2025Updated 6 months ago
- [ICLR 2026] Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache☆55Jan 26, 2026Updated last month
- [CVPR 2025] Official PyTorch implementation of Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability☆32Jul 1, 2025Updated 8 months ago
- Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"☆56Feb 2, 2026Updated last month
- ☆75Dec 8, 2025Updated 2 months ago
- Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics☆71Jan 13, 2026Updated last month
- [ICML 2024] Matrix Information Theory for Self-supervised Learning (https://arxiv.org/abs/2305.17326)☆31Sep 21, 2025Updated 5 months ago
- A modular graph based DataSet implementation for Pytorch☆37Updated this week
- [SIGGRAPH Asia 2025] Official Implementation of "ConsistEdit: Highly Consistent and Precise Training-free Visual Editing"☆69Dec 2, 2025Updated 3 months ago
- Official repository of "FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring"☆77Dec 5, 2025Updated 3 months ago
- ☆53Jan 5, 2026Updated last month
- Terminal Velocity Matching☆67Feb 14, 2026Updated 2 weeks ago
- ☆21Dec 14, 2025Updated 2 months ago
- The official repo of continuous speculative decoding☆31Mar 28, 2025Updated 11 months ago
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆40Mar 5, 2024Updated last year
- [CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models☆1,402Dec 16, 2025Updated 2 months ago
- [CVPR2026] Official repository for “DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation”☆176Updated this week
- [Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization☆55Sep 16, 2025Updated 5 months ago
- [ICLR 2026] UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models☆43Aug 4, 2025Updated 7 months ago
- Adapting LLaMA Decoder to Vision Transformer☆30May 20, 2024Updated last year
- [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution☆34Dec 23, 2024Updated last year
- [NeurIPS 2025 Oral] Representation Entanglement for Generation: Training Diffusion Transformers Is Much Easier Than You Think☆252Oct 4, 2025Updated 5 months ago