NUS-HPC-AI-Lab / SpeeD
SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
☆177Updated last month
Alternatives and similar repositories for SpeeD:
Users that are interested in SpeeD are comparing it to the libraries listed below
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation☆253Updated 2 weeks ago
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆97Updated 8 months ago
- The collection of awesome papers on alignment of diffusion models.☆138Updated 2 weeks ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆137Updated last month
- PyTorch code and model checkpoints for Score identity Distillation (SiD) and its adversarial version (SiDA)☆104Updated this week
- This is a repo to track the latest autoregressive visual generation papers.☆164Updated this week
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆142Updated 4 months ago
- MoVQGAN - model for the image encoding and reconstruction☆223Updated last year
- Scaling Diffusion Transformers with Mixture of Experts☆293Updated 6 months ago
- Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".☆287Updated 4 months ago
- official code for Diff-Instruct algorithm for one-step diffusion distillation☆70Updated 2 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆84Updated 5 months ago
- FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.☆39Updated 8 months ago
- [ICLR 2025] Rectified Diffusion: Straightness Is Not Your Need☆195Updated last week
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆103Updated 5 months ago
- My Implementation of Adversarial Diffusion Distillation https://arxiv.org/pdf/2311.17042.pdf☆64Updated 3 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆79Updated last month
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆161Updated 5 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆276Updated 4 months ago
- VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆176Updated last month
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆111Updated last year
- Score identity Distillation with Long and Short Guidance for One-Step Text-to-Image Generation☆52Updated this week
- ☆189Updated last month
- Implementation of "DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning"☆85Updated last year
- ☆144Updated 3 months ago
- GenEval: An object-focused framework for evaluating text-to-image alignment☆195Updated 2 weeks ago