Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers
β66Oct 16, 2024Updated last year
Alternatives and similar repositories for I-Max
Users that are interested in I-Max are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π₯π₯π₯A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.β166Dec 26, 2024Updated last year
- Official implementation of DiffuseHigh, *Younghyun Kim, *Geunmin Hwang, Junyu Zhang, Eunbyung Park.β70Dec 13, 2024Updated last year
- [BMVC 2023 (Oral)] Official pytorch implementation of the paper: "Unsupervised Hashing with Similarity Distribution Calibration"β23Sep 17, 2023Updated 2 years ago
- handy tools for user studyβ21May 21, 2024Updated last year
- Code repo for "SketchODE: Learning neural sketch representation in continuous time" published in ICLR 2022β11Apr 19, 2022Updated 3 years ago
- DreamDance: Personalized Text-to-video Generation by Combining Text-to-Image Synthesis and Motion Transferβ14Dec 16, 2022Updated 3 years ago
- PyTorch code and model checkpoints for Score identity Distillation (SiD) and its adversarial version (SiDA)β151Mar 29, 2025Updated 11 months ago
- [ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generationβ149Oct 9, 2025Updated 5 months ago
- Official pytorch implementation of "AlphaFlow: Understanding and Improving MeanFlow Models"β111Oct 24, 2025Updated 4 months ago
- β11Jun 28, 2024Updated last year
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretrainiβ¦β643Oct 16, 2025Updated 5 months ago
- β109Nov 27, 2024Updated last year
- Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformerβ444Jul 5, 2024Updated last year
- Fine-Grained Visual Classification via Simultaneously Learning of Multi-regional Multi-grained Featuresβ12Mar 2, 2021Updated 5 years ago
- [TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generationβ11Mar 7, 2026Updated 2 weeks ago
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)β78Apr 3, 2024Updated last year
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025β13Jun 25, 2024Updated last year
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)β534Sep 8, 2025Updated 6 months ago
- β18Aug 23, 2022Updated 3 years ago
- Official code for paper: Text-to-Image Rectified Flow as Plug-and-Play Priors [ICLR 2025]β140Apr 16, 2025Updated 11 months ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generationβ111Sep 19, 2025Updated 6 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generationβ2,253Feb 16, 2025Updated last year
- ChiroDiff: Modelling chirographic data with Diffusion Modelsβ19Apr 11, 2023Updated 2 years ago
- The dataset CoLan-150K and the concept decomposition in the paper Concept Lancet (CVPR 2025)β20Jan 18, 2026Updated 2 months ago
- Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Transformer Trainβ¦β28Updated this week
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"β138Oct 8, 2024Updated last year
- β415Mar 10, 2025Updated last year
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Methodβ27Oct 9, 2025Updated 5 months ago
- [DEPRECATED] Attempts to convert a Flux lora to a Chroma loraβ20Nov 9, 2025Updated 4 months ago
- [BMVC 2023 (Oral)] SketchDreamer: Interactive Text-Augmented Creative Sketch Ideationβ27Jun 8, 2025Updated 9 months ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controllerβ50Aug 5, 2025Updated 7 months ago
- Visualization of DiT self attention featuresβ237Aug 12, 2024Updated last year
- [ICLR2026] "OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs"β39Feb 7, 2026Updated last month
- [CVPR 2023] SketchXAI: A First Look at Explainability for Human Sketchesβ26Mar 21, 2024Updated 2 years ago
- [NeurIPS 2024] π«CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matchingβ169Nov 18, 2024Updated last year
- [ICCV 2025] CHORDS: Diffusion Sampling Accelerator with Multi-core Hierarchical ODE Solversβ16Mar 3, 2026Updated 2 weeks ago
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesisβ131May 16, 2025Updated 10 months ago
- VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.β42Updated this week
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"β37Jan 21, 2025Updated last year