Shenyi-Z / ToCa
Accelerating Diffusion Transformers with Token-wise Feature Caching
☆19Updated this week
Related projects ⓘ
Alternatives and complementary repositories for ToCa
- 📚 Collection of awesome generation acceleration resources.☆39Updated this week
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆71Updated 3 months ago
- ☆97Updated last month
- This is a repo to track the latest autoregressive visual generation papers.☆43Updated last month
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 3 months ago
- 🔥ImageFolder: Autoregressive Image Generation with Folded Tokens☆53Updated 3 weeks ago
- The collection of awesome papers on alignment of diffusion models.☆45Updated last week
- FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.☆28Updated 4 months ago
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆79Updated last week
- Unified Multi-modal IAA Baseline and Benchmark☆70Updated last month
- [CVPR 2024 Highlight] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Mo…☆55Updated 3 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆118Updated 4 months ago
- a collection of awesome autoregressive visual generation models☆39Updated last week
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆72Updated last year
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆48Updated last month
- ICCV2023-Diffusion-Papers☆110Updated last year
- PyTorch code for Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers☆33Updated 2 months ago
- [ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions☆106Updated last week
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training☆161Updated 3 weeks ago
- Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" proposed by Pekin…☆52Updated 3 weeks ago
- The paper collections for the autoregressive models in vision.☆101Updated this week
- [NeurIPS 2023] Structural Pruning for Diffusion Models☆163Updated 4 months ago
- ☆32Updated last month
- LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models☆98Updated 5 months ago
- ☆41Updated 7 months ago
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆40Updated 4 months ago
- 🔥Official PyTorch implementation for "LM4LV: A Frozen Large Language Model for Low-level Vision Tasks".☆41Updated 5 months ago
- ☆16Updated last year
- DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception☆116Updated last month
- Official repo for 【FaceScore: Benchmarking and Enhancing Face Quality in Human Generation】☆58Updated 3 weeks ago