A light-weight and high-efficient training framework for accelerating diffusion tasks.
☆51Sep 14, 2024Updated last year
Alternatives and similar repositories for LiteGen
Users that are interested in LiteGen are comparing it to the libraries listed below
Sorting:
- Memory Efficient Training Framework for Large Video Generation Model☆25Apr 22, 2024Updated last year
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models☆917Mar 17, 2025Updated 11 months ago
- CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading☆34Jan 21, 2026Updated last month
- Code for our ICCV 2025 paper "Adaptive Caching for Faster Video Generation with Diffusion Transformers"☆167Nov 5, 2024Updated last year
- The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“☆124Jan 25, 2025Updated last year
- Differentiable forward warping from base image to match image using base disparity.☆13Jun 10, 2020Updated 5 years ago
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆38Jan 9, 2026Updated last month
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆259Dec 27, 2024Updated last year
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation☆566Sep 16, 2024Updated last year
- ☆109Nov 27, 2024Updated last year
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆14Nov 20, 2025Updated 3 months ago
- Syphus: Automatic Instruction-Response Generation Pipeline☆14Dec 14, 2023Updated 2 years ago
- [ACL2025 Oral & Award] Evaluate Image/Video Generation like Humans - Fast, Explainable, Flexible☆121Aug 10, 2025Updated 6 months ago
- Let's finetune video generation models!☆543Sep 15, 2025Updated 5 months ago
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆17Sep 3, 2024Updated last year
- Effort to open-source 10.5 trillion parameter Gemini model.☆17Dec 6, 2023Updated 2 years ago
- Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation☆17Apr 3, 2024Updated last year
- ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models☆91Sep 12, 2025Updated 5 months ago
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Apr 12, 2024Updated last year
- Un-*** 50 billions multimodality dataset☆23Sep 14, 2022Updated 3 years ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆32Dec 21, 2024Updated last year
- Score identity Distillation with Long and Short Guidance for One-Step Text-to-Image Generation☆95Dec 4, 2025Updated 2 months ago
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,496Feb 23, 2026Updated last week
- Official Pytorch implementation of "DBS: Dynamic Batch Size for Distributed Deep Neural Network Training"☆23Sep 30, 2021Updated 4 years ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆183Jul 21, 2025Updated 7 months ago
- Text and image to video generation: Kandinsky 4.0 (2024)☆150Dec 17, 2024Updated last year
- ☆28Feb 11, 2025Updated last year
- Awesome autoregressive vision foundation models☆26Dec 24, 2024Updated last year
- Distributed IO-aware Attention algorithm☆24Sep 24, 2025Updated 5 months ago
- A simple Python tool to measure the performance of ONNX models.☆27Sep 15, 2024Updated last year
- Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…☆52Dec 30, 2023Updated 2 years ago
- xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism☆2,549Updated this week
- ☆31Sep 1, 2025Updated 6 months ago
- Scalable and memory-optimized training of diffusion models☆1,338Jun 4, 2025Updated 8 months ago
- [NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusion☆107Nov 20, 2024Updated last year
- ICML2025, I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models☆193Sep 7, 2025Updated 5 months ago
- The official PyTorch Implementation of Charm: The Missing Piece in ViT fine-tuning for Image Aesthetic Assessment☆41Dec 4, 2025Updated 2 months ago
- ☆415Mar 10, 2025Updated 11 months ago
- (WIP) Parallel inference for black-forest-labs' FLUX model.☆19Nov 18, 2024Updated last year