kabachuha / OpenMMDiT
Open(MM)DiT: An Easy, Fast and Memory-Efficient System for (MM)DiT Training and Inference
☆21Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for OpenMMDiT
- ☆40Updated this week
- Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitching"☆96Updated 8 months ago
- Rectified Diffusion: Straightness Is Not Your Need☆128Updated last week
- FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.☆28Updated 4 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆30Updated 4 months ago
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆96Updated 2 weeks ago
- TerDiT: Ternary Diffusion Models with Transformers☆62Updated 5 months ago
- ☆26Updated 6 months ago
- Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆23Updated 2 weeks ago
- Implementation of the proposed MaskBit from Bytedance AI☆62Updated last week
- ☆35Updated 5 months ago
- Official implementation of "Is This Loss Informative? Faster Text-to-Image Customization by Tracking Objective Dynamics" (NeurIPS 2023)☆37Updated last year
- ☆105Updated 8 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆26Updated 6 months ago
- "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆40Updated 3 weeks ago
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆37Updated last year
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆34Updated last month
- DREAM: Diffusion Rectification and Estimation-Adaptive Models (CVPR 2024)☆34Updated 5 months ago
- Gradient-Free Textual Inversion for Personalized Text-to-Image Generation☆38Updated last year
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆78Updated 7 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆84Updated 4 months ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆57Updated last month
- ☆57Updated 4 months ago
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆75Updated 4 months ago
- HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆75Updated 7 months ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆61Updated 5 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆36Updated this week
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆60Updated 6 months ago
- Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆60Updated last month
- [WACV 2025] MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning☆66Updated 3 weeks ago