EvelynZhang-epiclab / SiTo
[AAAI-2025] The offical code for SiTo (Similarity-based Token Pruning for Stable Diffusion Models)
☆23Updated 3 months ago
Alternatives and similar repositories for SiTo:
Users that are interested in SiTo are comparing it to the libraries listed below
- 📚 Collection of token reduction for model compression resources.☆51Updated last week
- [CVPR2025] FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression☆36Updated last month
- [CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow☆105Updated 2 weeks ago
- [ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders☆17Updated 2 months ago
- Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference".☆92Updated last month
- [CVPR 2025] The official implementation of "CacheQuant: Comprehensively Accelerated Diffusion Models"☆20Updated 2 weeks ago
- ☆26Updated 10 months ago
- Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.☆70Updated 4 months ago
- Official repository of InLine attention (NeurIPS 2024)☆45Updated 4 months ago
- From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers☆71Updated 3 weeks ago
- (ToCa-v2) A New version of ToCa,with faster speed and better acceleration!☆31Updated last month
- Accelerating Diffusion Transformers with Token-wise Feature Caching☆130Updated last month
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆98Updated 3 weeks ago
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆114Updated 4 months ago
- 📚 Collection of awesome generation acceleration resources.☆202Updated this week
- ☆69Updated 3 weeks ago
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆107Updated last month
- This is the official implementation for ControlVAR.☆102Updated 4 months ago
- A collection of diffusion models based on FLUX/DiT for image/video generation, editing, reconstruction, inpainting .etc.☆40Updated this week
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆140Updated 2 months ago
- ☆73Updated 5 months ago
- Generate one 2K image on single 3090 GPU!☆24Updated 2 weeks ago
- 🔥Official PyTorch implementation for "LM4LV: A Frozen Large Language Model for Low-level Vision Tasks".☆50Updated 10 months ago
- Official implementation of Unified Reward Model for Multimodal Understanding and Generation.☆240Updated last week
- Implements VAR+CLIP for text-to-image (T2I) generation☆135Updated 2 months ago
- List of diffusion related active submissions on OpenReview for ICLR 2025.☆22Updated 5 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆173Updated last week
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆183Updated last month
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆312Updated last month
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆101Updated 9 months ago