SwayStar123 / SpeedrunDiTLinks
SR-DiT Speedrunning ImageNet Diffusion
☆65Updated this week
Alternatives and similar repositories for SpeedrunDiT
Users that are interested in SpeedrunDiT are comparing it to the libraries listed below
Sorting:
- Inference-time scaling of diffusion-based image and video generation models.☆172Updated 5 months ago
- Official implementation of Inductive Moment Matching☆566Updated 5 months ago
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆132Updated 8 months ago
- Train VAE like a boss☆307Updated last year
- [NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization☆159Updated 3 months ago
- DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space☆310Updated 2 months ago
- Official Implementation of weights2weights☆152Updated 9 months ago
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆89Updated 11 months ago
- ☆162Updated 2 months ago
- PixNerd: Pixel Neural Field Diffusion☆138Updated last week
- [CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…☆324Updated 6 months ago
- ☆211Updated 10 months ago
- DDT: Decoupled Diffusion Transformer☆333Updated 3 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆304Updated 9 months ago
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆277Updated 6 months ago
- [ECCV 2024, Oral] FMBoost: Boosting Latent Diffusion with Flow Matching☆254Updated 2 months ago
- This repo provides a working re-implementation of Latent Adversarial Diffusion Distillation by AMD☆120Updated 5 months ago
- (CVPR 2025) Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis☆200Updated 5 months ago
- Krea Realtime 14B. An open-source realtime AI video model.☆423Updated last month
- Minimal Differentiable Image Reward Functions☆106Updated 4 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆307Updated last year
- ☆172Updated 3 months ago
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆310Updated 11 months ago
- A Video Tokenizer Evaluation Dataset☆141Updated 11 months ago
- [NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving☆257Updated 4 months ago
- [ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"☆574Updated 5 months ago
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆529Updated 3 months ago
- [AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆353Updated 8 months ago
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆645Updated last year
- Official Github Repo for Neurips 2024 Paper Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment☆61Updated 6 months ago