[ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging techniques, while incorporating a differentiable compression rate.
☆104Jul 14, 2023Updated 2 years ago
Alternatives and similar repositories for DiffRate
Users that are interested in DiffRate are comparing it to the libraries listed below
Sorting:
- Official implementation of "SViT: Revisiting Token Pruning for Object Detection and Instance Segmentation"☆37Dec 5, 2023Updated 2 years ago
- ☆48Aug 7, 2023Updated 2 years ago
- Python code for ICLR 2022 spotlight paper EViT: Expediting Vision Transformers via Token Reorganizations☆199Sep 3, 2023Updated 2 years ago
- Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer☆74Jul 13, 2022Updated 3 years ago
- [ICCV 23]This is a Pytorch implementation of our paper "SMMix: Self-Motivated Image Mixing for Vision Transformers"☆16Jul 14, 2023Updated 2 years ago
- Code for Learned Thresholds Token Merging and Pruning for Vision Transformers (LTMP). A technique to reduce the size of Vision Transforme…☆17Nov 24, 2024Updated last year
- A method to increase the speed and lower the memory footprint of existing vision transformers.☆1,170Jun 17, 2024Updated last year
- Official PyTorch implementation of A-ViT: Adaptive Tokens for Efficient Vision Transformer (CVPR 2022)☆166Jul 14, 2022Updated 3 years ago
- ☆28Nov 29, 2022Updated 3 years ago
- Github repository for the paper Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers.☆33Updated this week
- ☆15Feb 28, 2023Updated 3 years ago
- LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models☆164Sep 27, 2025Updated 5 months ago
- ☆53Aug 28, 2024Updated last year
- ☆17Jul 10, 2022Updated 3 years ago
- [ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers☆34Dec 30, 2024Updated last year
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2022 — Carrying out CNN Channel Pruning in a White Box☆18Feb 15, 2022Updated 4 years ago
- Pytorch implementation of our paper accepted by TPAMI 2023 — Lottery Jackpots Exist in Pre-trained Models☆35Jun 19, 2023Updated 2 years ago
- Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)☆104May 3, 2024Updated last year
- [ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…☆55Dec 1, 2023Updated 2 years ago
- Official PyTorch implementation of Which Tokens to Use? Investigating Token Reduction in Vision Transformers presented at ICCV 2023 NIVT …☆35Aug 10, 2023Updated 2 years ago
- [CVPR-22] This is the official implementation of the paper "Adavit: Adaptive vision transformers for efficient image recognition".☆56Aug 18, 2022Updated 3 years ago
- Official implementation of CVPR 2024 paper "Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers".☆40Jul 30, 2025Updated 7 months ago
- ☆10Apr 14, 2020Updated 5 years ago
- The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"☆53Dec 30, 2024Updated last year
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆74Nov 15, 2022Updated 3 years ago
- [CVPR2023] Practical Network Acceleration with Tiny Sets☆14Jul 28, 2023Updated 2 years ago
- The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…☆49Oct 5, 2022Updated 3 years ago
- [NeurIPS 2020] "FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training" by Yonggan Fu, Ha…☆10Feb 13, 2022Updated 4 years ago
- This repository contains the code for the paper "TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back)…☆13Updated this week
- Pytorch implementation of our paper accepted by ECCV 2022 -- ARM: Any-Time Super-Resolution Method (https://arxiv.org/abs/2203.10812)☆82Sep 28, 2022Updated 3 years ago
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated last year
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆35Jun 12, 2024Updated last year
- [ICLR2025] Accelerating Diffusion Transformers with Token-wise Feature Caching☆210Mar 14, 2025Updated 11 months ago
- Adapting LLaMA Decoder to Vision Transformer☆30May 20, 2024Updated last year
- PiX: Dynamic Channel Sampling for ConvNets (CVPR 2024)☆14Jun 14, 2024Updated last year
- Champion Solutions repository for Perception Test challenges in ICCV2023 workshop.☆14Oct 18, 2023Updated 2 years ago
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆17Nov 11, 2024Updated last year
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆15Jul 4, 2022Updated 3 years ago
- Multi-level Consistency Learning for Semi-supervised Domain Adaptation, IJCAI 2022☆14Aug 31, 2022Updated 3 years ago