YorkUCVIL / VTCD
☆13Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for VTCD
- Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024☆42Updated 2 months ago
- ☆12Updated 8 months ago
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆61Updated 7 months ago
- Official implementation of TCL (CVPR 2023)☆108Updated last year
- A curated list of awesome self-supervised learning methods in videos☆112Updated this week
- [ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"☆28Updated last month
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆88Updated 6 months ago
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆24Updated 4 months ago
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆39Updated last year
- Official implementation of "Harnessing Large Language Models for Training-free Video Anomaly Detection", CVPR 2024☆56Updated 3 months ago
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆37Updated 10 months ago
- Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs".☆40Updated 2 months ago
- Text-Image Alignment for Diffusion-based Perception (TADP) - CVPR 2024☆23Updated 2 months ago
- Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024☆57Updated 4 months ago
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆73Updated 9 months ago
- [ECCV 2024] PyTorch implementation of CropMAE, introduced in "Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders"☆44Updated 4 months ago
- Official repo for our ICML 23 paper: "Multi-Modal Classifiers for Open-Vocabulary Object Detection"☆80Updated last year
- ☆35Updated 7 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆167Updated 9 months ago
- [CVPR 2022] OCSampler: Compressing Videos to One Clip with Single-step Sampling☆17Updated 2 years ago
- A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating…☆107Updated 7 months ago
- Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024☆45Updated last month
- ☆26Updated last year
- Library implementation of "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆22Updated last week
- FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)☆28Updated 2 months ago
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆55Updated 3 months ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆110Updated last year
- [NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization☆95Updated 9 months ago
- Composed Video Retrieval☆45Updated 6 months ago
- [CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".☆90Updated 2 months ago