sunilhoho / EVEREST
Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].
☆21Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for EVEREST
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆38Updated 7 months ago
- [CVPR 2023] Learning Attention as Disentangler for Compositional Zero-shot Learning☆40Updated last year
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆37Updated 11 months ago
- ☆28Updated 11 months ago
- Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)☆51Updated 5 months ago
- Compress conventional Vision-Language Pre-training data☆49Updated last year
- ☆61Updated last year
- [ICCV 2023] Prompt-aligned Gradient for Prompt Tuning☆151Updated last year
- ☆58Updated 2 years ago
- Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".☆37Updated 6 months ago
- Official repository for "CLIP model is an Efficient Continual Learner".☆81Updated last year
- [ICLR 2023] The official code for our ICLR 2023 (top25%) paper: "Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class…☆81Updated last year
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆31Updated last year
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆108Updated last year
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆25Updated 3 months ago
- Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models☆81Updated 8 months ago
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆56Updated 3 months ago
- source code for NeurIPS'23 paper "Dream the Impossible: Outlier Imagination with Diffusion Models"☆61Updated 3 weeks ago
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆146Updated 11 months ago
- Official implementation for CVPR'23 paper "BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning"☆105Updated last year
- [CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval☆45Updated 5 months ago
- ☆22Updated last year
- code for "Multitask Vision-Language Prompt Tuning" https://arxiv.org/abs/2211.11720☆54Updated 5 months ago
- [NeurIPS 2023] Meta-Adapter☆40Updated last year
- Code for "Class-Incremental Learning for Action Recognition in Videos", ICCV 2021☆19Updated 2 years ago
- Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023☆52Updated last year
- Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models☆45Updated last year
- Official Implementation of LADS (Latent Augmentation using Domain descriptionS)☆49Updated last year
- [arXiv] Cross-Modal Adapter for Text-Video Retrieval☆55Updated 2 years ago
- Official code of "Generating Instance-level Prompts for Rehearsal-free Continual Learning (ICCV 2023)"☆42Updated last year