xuyang-liu16 / Awesome-Generation-Acceleration
📚 Collection of awesome generation acceleration resources.
☆39Updated this week
Related projects ⓘ
Alternatives and complementary repositories for Awesome-Generation-Acceleration
- Accelerating Diffusion Transformers with Token-wise Feature Caching☆19Updated this week
- This is a repo to track the latest autoregressive visual generation papers.☆43Updated last month
- 🔥ImageFolder: Autoregressive Image Generation with Folded Tokens☆53Updated 3 weeks ago
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆71Updated 3 months ago
- The collection of awesome papers on alignment of diffusion models.☆45Updated last week
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 3 months ago
- The paper collections for the autoregressive models in vision.☆101Updated this week
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆118Updated 4 months ago
- ☆110Updated 4 months ago
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆72Updated last year
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆48Updated last month
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training☆161Updated 3 weeks ago
- Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" proposed by Pekin…☆52Updated 3 weeks ago
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆72Updated 2 months ago
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆111Updated last month
- Implements VAR+CLIP for image generation☆78Updated 3 months ago
- a collection of awesome autoregressive visual generation models☆39Updated last week
- [ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions☆106Updated last week
- Unified Multi-modal IAA Baseline and Benchmark☆70Updated last month
- ☆97Updated last month
- [NeurIPS 2023] Structural Pruning for Diffusion Models☆163Updated 4 months ago
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆46Updated 2 months ago
- Official Implementations "Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models" (ICLR2024)☆41Updated last month
- 🔥Official PyTorch implementation for "LM4LV: A Frozen Large Language Model for Low-level Vision Tasks".☆41Updated 5 months ago
- ☆32Updated last month
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆21Updated 3 weeks ago
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆40Updated 4 months ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆25Updated 2 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆76Updated last month
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆79Updated last week