[NeurIPS 2025 Oral] Official Code for Exploring Diffusion Transformer Designs via Grafting
☆72Jan 9, 2026Updated 2 months ago
Alternatives and similar repositories for grafting
Users that are interested in grafting are comparing it to the libraries listed below
Sorting:
- [ICLR26] LumiTex: Towards High-Fidelity PBR Texture Generation with Illumination Context☆40Feb 27, 2026Updated last week
- MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]☆23Dec 10, 2025Updated 3 months ago
- Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation, ECCV 2024☆22Feb 15, 2024Updated 2 years ago
- Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"☆29Jun 25, 2025Updated 8 months ago
- [NeurIPS 2025, Spotlight]: Ambient-o: Training Good models with Bad Data.☆31Jan 21, 2026Updated last month
- [NeurIPS 2025] ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models☆31Jul 1, 2025Updated 8 months ago
- Source code for Activated LoRA☆24Nov 22, 2025Updated 3 months ago
- Code for ICML 2025 Paper "Highly Compressed Tokenizer Can Generate Without Training"☆203Jun 10, 2025Updated 8 months ago
- ☆25Aug 12, 2025Updated 6 months ago
- [CVPR2025] Official Implementations "One-Way Ticket : Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models"☆28Jul 28, 2025Updated 7 months ago
- Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?☆42Jul 26, 2025Updated 7 months ago
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆47Jun 13, 2024Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Jul 16, 2024Updated last year
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆23Jul 30, 2025Updated 7 months ago
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Jul 10, 2023Updated 2 years ago
- [ICLR 2026] Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache☆55Jan 26, 2026Updated last month
- MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills [SIGGRAPH 2025]☆75Jan 21, 2026Updated last month
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆62Apr 27, 2025Updated 10 months ago
- [NeurIPS 2024, spotlight] Multivariate Learned Adaptive Noise for Diffusion Models☆31Dec 11, 2024Updated last year
- Code of ImageNet training and evaluation for the paper: RENAS: Reinforced Evolutionary Neural Architecture Search☆20May 15, 2019Updated 6 years ago
- ☆27Feb 9, 2023Updated 3 years ago
- Official Code Release of SAGE: Scalable Agentic 3D Scene Generation for Embodied AI☆150Feb 20, 2026Updated 2 weeks ago
- Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021☆29Sep 25, 2021Updated 4 years ago
- ☆39May 20, 2025Updated 9 months ago
- Evaluating text-to-image/video/3D models with VQAScore☆377Sep 22, 2025Updated 5 months ago
- A simple script to see how my ideas evolve over time☆44Jun 4, 2025Updated 9 months ago
- ☆128Nov 27, 2025Updated 3 months ago
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆309Oct 12, 2025Updated 4 months ago
- [CVPR 2026] Training-free Mixed-Resolution Latent Upsampling for Spatially Accelerated Diffusion Transformers☆54Feb 22, 2026Updated 2 weeks ago
- ☆47Jan 26, 2026Updated last month
- StableWorld: Towards Stable and Consistent Long Interactive Video Generation☆81Feb 3, 2026Updated last month
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆35Mar 12, 2024Updated last year
- [NeurIPS'25 Spotlight] Boosting Generative Image Modeling via Joint Image-Feature Synthesis☆116Nov 3, 2025Updated 4 months ago
- [ACM MM 2025] HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation☆157Sep 4, 2025Updated 6 months ago
- Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"☆78Aug 25, 2025Updated 6 months ago
- ☆58Oct 15, 2025Updated 4 months ago
- ☆65Jan 6, 2026Updated 2 months ago
- [ICLR26] GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning☆104Jan 27, 2026Updated last month
- ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback☆121Sep 20, 2025Updated 5 months ago