uzh-rpg / svitLinks
Official implementation of "SViT: Revisiting Token Pruning for Object Detection and Instance Segmentation"
☆32Updated last year
Alternatives and similar repositories for svit
Users that are interested in svit are comparing it to the libraries listed below
Sorting:
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆79Updated 3 months ago
- [ICCV 2023] Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment☆43Updated last year
- [ECCV 2024 Workshop Best Paper Award] Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion☆34Updated 9 months ago
- [ICLR 2025] Official PyTorch implementation of "DECO: Query-Based End-to-End Object Detection with ConvNets"☆53Updated 5 months ago
- Official PyTorch implementation of Which Tokens to Use? Investigating Token Reduction in Vision Transformers presented at ICCV 2023 NIVT …☆34Updated last year
- Video Feature Enhancement with PyTorch☆31Updated 7 months ago
- [CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference☆30Updated last year
- Segment Anything with Deictic Prompting☆27Updated 2 months ago
- (ICCV 2023) Vision Transformer Adapters for Generalizable Multitask Learning☆19Updated last year
- [AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets☆36Updated 10 months ago
- [ICCV 2023] Deep Equilibrium Object Detection☆25Updated 3 weeks ago
- [CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer☆72Updated last year
- [ICCV23] Official Implementation of CMDA: Cross-Modality Domain Adaptation for Nighttime Semantic Segmentation☆32Updated 7 months ago
- ☆34Updated last year
- ☆30Updated last year
- Official Code of CVPR'23 Paper "VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision"☆22Updated last year
- [ICCV'23] Cascade-DETR: Delving into High-Quality Universal Object Detection☆98Updated last year
- [ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging tech…☆99Updated 2 years ago
- [ECCV'24 Oral] Anytime Continual Learning for Open Vocabulary Classification☆20Updated 9 months ago
- ☆77Updated last year
- Official implementation of the WACV 2024 paper CLIP-DIY☆33Updated last year
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆30Updated 10 months ago
- Official Pytorch implementation of Dynamic-Token-Pruning (ICCV2023)☆21Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- [ICCV2023] DETRDistill: A Universal Knowledge Distillation Framework for DETR-families☆57Updated last year
- ☆35Updated last year
- Official implementation of CVPR 2024 paper "Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers".☆39Updated last year
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".☆40Updated 5 months ago
- Exploring Relational Context for Multi-Task Dense Prediction [ICCV 2021]☆51Updated last year
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆38Updated last year