Vchitect / LiteGenLinks
A light-weight and high-efficient training framework for accelerating diffusion tasks.
☆50Updated last year
Alternatives and similar repositories for LiteGen
Users that are interested in LiteGen are comparing it to the libraries listed below
Sorting:
- Code for our ICCV 2025 paper "Adaptive Caching for Faster Video Generation with Diffusion Transformers"☆160Updated last year
- ☆129Updated 4 months ago
- minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora☆40Updated last year
- An Efficient Text-to-Image Generation Pretrain Pipeline☆119Updated 7 months ago
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆45Updated 4 months ago
- ☆108Updated 11 months ago
- DiT for VAE (and Video Generation)☆35Updated last year
- Finetuning and inference tools for the CogView4 and CogVideoX model series.☆104Updated 6 months ago
- Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache☆52Updated last month
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆99Updated last month
- [ICML 2025] Official Implementation of Hierarchical Masked Autoregressive Models with Low-Resolution Token Pivots☆29Updated 5 months ago
- Official implementation of our paper: "Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing" …☆71Updated 5 months ago
- [ICLR 2025] Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching☆52Updated 6 months ago
- The official implementation of our paper "Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption"☆38Updated 5 months ago
- ☆132Updated last month
- T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)☆34Updated last month
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆58Updated 7 months ago
- Image Tokenizer Needs Post-Training☆24Updated last month
- Code for Draft Attention☆92Updated 5 months ago
- 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation☆73Updated last week
- This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehe…☆107Updated 2 months ago
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.☆186Updated last year
- OpenVideo specializes in the domain of text-to-video generation, with the goal of providing high-quality and diverse video datasets to AI…☆109Updated 5 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆120Updated 8 months ago
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆35Updated 6 months ago
- ☆66Updated last year
- TerDiT: Ternary Diffusion Models with Transformers☆71Updated last year
- ☆56Updated 6 months ago
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆116Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Updated last year