Vchitect / LiteGenLinks
A light-weight and high-efficient training framework for accelerating diffusion tasks.
☆50Updated last year
Alternatives and similar repositories for LiteGen
Users that are interested in LiteGen are comparing it to the libraries listed below
Sorting:
- Code for our ICCV 2025 paper "Adaptive Caching for Faster Video Generation with Diffusion Transformers"☆161Updated last year
- ☆130Updated 5 months ago
- An Efficient Text-to-Image Generation Pretrain Pipeline☆120Updated 7 months ago
- minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora☆40Updated last year
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆46Updated 4 months ago
- ☆108Updated last year
- Finetuning and inference tools for the CogView4 and CogVideoX model series.☆108Updated 6 months ago
- DiT for VAE (and Video Generation)☆35Updated last year
- Code for Draft Attention☆94Updated 6 months ago
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆100Updated 2 months ago
- Official implementation of our paper: "Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing" …☆74Updated 6 months ago
- [ICML 2025] Official Implementation of Hierarchical Masked Autoregressive Models with Low-Resolution Token Pivots☆29Updated 6 months ago
- [ICLR 2025] Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching☆52Updated 7 months ago
- The official implementation of our paper "Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption"☆38Updated 6 months ago
- ☆67Updated last year
- Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache☆52Updated 2 months ago
- Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"☆75Updated 3 months ago
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆35Updated 7 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆120Updated 9 months ago
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆58Updated 8 months ago
- Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"☆55Updated 5 months ago
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆116Updated last year
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆254Updated 11 months ago
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.☆188Updated last year
- ☆75Updated last month
- TerDiT: Ternary Diffusion Models with Transformers☆72Updated last year
- ☆135Updated last month
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆208Updated 2 months ago
- The official implementation of Distribution Backtracking Distillation for One-step Diffusion Models☆32Updated 10 months ago
- FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.☆52Updated last year