EvelynZhang-epiclab / SiTo
[AAAI-2025] The offical code for SiTo (Similarity-based Token Pruning for Stable Diffusion Models)
☆22Updated 2 months ago
Alternatives and similar repositories for SiTo:
Users that are interested in SiTo are comparing it to the libraries listed below
- (ToCa-v2) A New version of ToCa,with faster speed and better acceleration!☆30Updated 2 weeks ago
- From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers☆50Updated last week
- ☆25Updated 9 months ago
- 📚 Collection of token reduction for model compression resources.☆47Updated this week
- 📚 Collection of awesome generation acceleration resources.☆182Updated this week
- [ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders☆15Updated last month
- Official repository of InLine attention (NeurIPS 2024)☆44Updated 3 months ago
- [CVPR 2025] The official implementation of "CacheQuant: Comprehensively Accelerated Diffusion Models"☆19Updated 3 weeks ago
- Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference".☆83Updated 3 weeks ago
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆92Updated 3 weeks ago
- Accelerating Diffusion Transformers with Token-wise Feature Caching☆115Updated 2 weeks ago
- 🔥Official PyTorch implementation for "LM4LV: A Frozen Large Language Model for Low-level Vision Tasks".☆48Updated 9 months ago
- [CVPR2025] FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression☆33Updated last month
- ☆59Updated this week
- Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.☆59Updated 3 months ago
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆100Updated 3 weeks ago
- PyTorch code for our paper "ARB-LLM: Alternating Refined Binarizations for Large Language Models"☆24Updated last week
- ☆70Updated 4 months ago
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆109Updated 9 months ago
- Implements VAR+CLIP for text-to-image (T2I) generation☆131Updated 2 months ago
- (CVPR2024) Official implementation of paper: "Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model"☆145Updated 3 months ago
- A collection of diffusion models based on FLUX/DiT for image/video generation, editing, reconstruction, inpainting .etc.☆35Updated this week
- [CVPR 2025] Official code of "From Zero to Detail: Deconstructing Ultra-High-Definition Image Restoration from Progressive Spectral Persp…☆27Updated this week
- Official implementation of Unified Reward Model for Multimodal Understanding and Generation.☆225Updated this week
- Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"☆21Updated this week
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆56Updated last month
- This is the official pytorch implementation for the paper: Towards Accurate Post-training Quantization for Diffusion Models.(CVPR24 Poste…☆34Updated 9 months ago
- This is the official implementation for ControlVAR.☆101Updated 3 months ago
- Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"☆26Updated this week
- ✈️ Accelerating Vision Diffusion Transformers with Skip Branches.☆62Updated this week