sunilhoho / EVERESTLinks
Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].
☆27Updated last year
Alternatives and similar repositories for EVEREST
Users that are interested in EVEREST are comparing it to the libraries listed below
Sorting:
- Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".☆49Updated last year
- ☆29Updated last year
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆39Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Updated last year
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆27Updated 3 months ago
- [CVPR 2023] Learning Attention as Disentangler for Compositional Zero-shot Learning☆39Updated last year
- ☆61Updated 2 years ago
- ☆17Updated 5 months ago
- (NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment☆31Updated 9 months ago
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆40Updated last year
- [ICCV 2023] Official implementation of paper "SOAR: Scene-debiasing Open-set Action Recognition".☆11Updated last year
- Compress conventional Vision-Language Pre-training data☆51Updated last year
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆119Updated last year
- ☆26Updated last year
- Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024☆61Updated 9 months ago
- Code for Static and Dynamic Concepts for Self-supervised Video Representation Learning.☆10Updated 2 years ago
- Code for "Class-Incremental Learning for Action Recognition in Videos", ICCV 2021☆20Updated 2 years ago
- [NeurIPS 2023 (Spotlight)] Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts☆14Updated last year
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆38Updated 2 months ago
- Official implementation of Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement.☆31Updated 10 months ago
- ☆39Updated last year
- [ICLR 2023] Temporal Alignment Representations with Contrastive Learning☆26Updated 2 years ago
- Code and data for the paper "Emergent Visual-Semantic Hierarchies in Image-Text Representations" (ECCV 2024)☆28Updated 10 months ago
- [ICCV2023] - CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation☆36Updated 8 months ago
- ☆56Updated 5 months ago
- ☆24Updated last year
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆33Updated 2 years ago
- ☆80Updated 2 years ago
- ☆96Updated last year
- Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023☆53Updated last year