sunilhoho / EVEREST
Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].
☆21Updated 3 months ago
Related projects: ⓘ
- [CVPR 2023] Learning Attention as Disentangler for Compositional Zero-shot Learning☆36Updated last year
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆36Updated 9 months ago
- Official implementation for CVPR'23 paper "BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning"☆102Updated last year
- ☆27Updated 9 months ago
- Test-time Prompt Tuning (TPT) for zero-shot generalization in vision-language models (NeurIPS 2022))☆136Updated last year
- ☆60Updated last year
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆137Updated 9 months ago
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆104Updated last year
- ☆56Updated 2 years ago
- Official code of "Generating Instance-level Prompts for Rehearsal-free Continual Learning (ICCV 2023)"☆40Updated 10 months ago
- Official Implementation of paper: PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery…☆39Updated last year
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆36Updated 5 months ago
- Compress conventional Vision-Language Pre-training data☆49Updated 11 months ago
- [ICLR 2023] The official code for our ICLR 2023 (top25%) paper: "Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class…☆80Updated last year
- [ICCV 2023] Prompt-aligned Gradient for Prompt Tuning☆146Updated last year
- ☆40Updated 7 months ago
- Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models☆44Updated 11 months ago
- Official Implementation of LADS (Latent Augmentation using Domain descriptionS)☆49Updated last year
- ☆48Updated 9 months ago
- Code for "Class-Incremental Learning for Action Recognition in Videos", ICCV 2021☆18Updated last year
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆51Updated last month
- [NeurIPS 2023 (Spotlight)] Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts☆13Updated 7 months ago
- Code for "BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation [ICML2024]".☆32Updated 3 months ago
- Official repository of "Back to Source: Diffusion-Driven Test-Time Adaptation"☆69Updated 9 months ago
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆22Updated 3 weeks ago
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆32Updated last year
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆65Updated last year
- [arXiv] Cross-Modal Adapter for Text-Video Retrieval☆53Updated last year
- [ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"☆15Updated last month
- Temporal Alignment Representations with Contrastive Learning☆22Updated last year