JoakimHaurum / ATC
Official PyTorch implementation of Agglomerative Token Clustering presented at ECCV 2024
☆12Updated 3 months ago
Alternatives and similar repositories for ATC:
Users that are interested in ATC are comparing it to the libraries listed below
- Official PyTorch implementation of Which Tokens to Use? Investigating Token Reduction in Vision Transformers presented at ICCV 2023 NIVT …☆32Updated last year
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆38Updated last year
- ☆21Updated last year
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆70Updated last year
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆59Updated 8 months ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆32Updated 7 months ago
- The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"☆38Updated last week
- ☆32Updated last year
- [ICCV 2023 oral] This is the official repository for our paper: ''Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning''.☆65Updated last year
- ☆22Updated last year
- Official implementation of CVPR 2024 paper "Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers".☆29Updated 8 months ago
- Official pytorch implementation of NeurIPS 2022 paper, TokenMixup☆48Updated 2 years ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆73Updated 4 months ago
- [ICCV'2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆32Updated last year
- PyTorch reimplementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".☆38Updated 2 years ago
- Code for Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking☆21Updated 3 months ago
- [CVPR-22] This is the official implementation of the paper "Adavit: Adaptive vision transformers for efficient image recognition".☆50Updated 2 years ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆71Updated 4 months ago
- ☆19Updated 8 months ago
- [CVPR 2023] Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detection☆31Updated last year
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆26Updated 11 months ago
- Official implementation of NeurIPS 2024 "Visual Fourier Prompt Tuning"☆21Updated last week
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆68Updated 6 months ago
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Updated last year
- Official Implementation of "Semantics-Consistent Feature Search for Self-Supervised Visual Representation Learning" in AAAI2024.☆13Updated 10 months ago
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆50Updated 2 months ago
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆41Updated last year
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆46Updated last year
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆65Updated 2 months ago