Test-Time Training on Video Streams
☆68Jul 24, 2023Updated 2 years ago
Alternatives and similar repositories for video-ttt-release
Users that are interested in video-ttt-release are comparing it to the libraries listed below
Sorting:
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Jun 20, 2023Updated 2 years ago
- We're Not Using Videos Effectively (TMLR 2024)☆17Feb 4, 2024Updated 2 years ago
- Official implementation and data release of the paper "Visual Prompting via Image Inpainting".☆317Aug 7, 2023Updated 2 years ago
- [CVPRW'22] Unsupervised Salient Object Detection With Spectral Cluster Voting☆65Apr 20, 2023Updated 2 years ago
- ☆11Nov 5, 2024Updated last year
- [NeurIPS 2022] The official implementation of "Learning to Discover and Detect Objects".☆111Jun 13, 2023Updated 2 years ago
- Implementations of some few-shot action recognition methods.☆43Jun 7, 2021Updated 4 years ago
- A large scale dataset for Video Captioning in Italian☆13May 16, 2023Updated 2 years ago
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆116Jun 4, 2023Updated 2 years ago
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Oct 27, 2023Updated 2 years ago
- ☆193Oct 22, 2022Updated 3 years ago
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆30Sep 5, 2023Updated 2 years ago
- DA-AIM: Exploiting Instance-based Mixed Sampling via Auxiliary Source Domain Supervision for Domain-adaptive Action Detection☆12Oct 6, 2022Updated 3 years ago
- ☆10Jul 14, 2023Updated 2 years ago
- Thermal Indoor Motion Dataset☆14Apr 27, 2023Updated 2 years ago
- Code for Static and Dynamic Concepts for Self-supervised Video Representation Learning.☆11Jul 28, 2022Updated 3 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆35Feb 26, 2026Updated last week
- [NeurIPS 25] Hyperbolic Contrastive Regularisation for Geometrically Aware Sign Language Translation☆20Nov 26, 2025Updated 3 months ago
- ☆242Jun 4, 2025Updated 9 months ago
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)☆36Apr 9, 2022Updated 3 years ago
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆15Apr 6, 2017Updated 8 years ago
- [IPDPS 2024] Adaptive neighbor sampling for temporal GNN☆16Feb 17, 2025Updated last year
- [WACV 2024] Instruct Me More! Random Prompting for Visual In-Context Learning☆18May 7, 2025Updated 10 months ago
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆15Jul 4, 2022Updated 3 years ago
- Task-adaptive Spatial-Temporal Video Sampler for Few-shot Action Recognition☆14Dec 22, 2022Updated 3 years ago
- [CVPR 2023] This is the official PyTorch implementation for "Dynamic Focus-aware Positional Queries for Semantic Segmentation".☆61Mar 4, 2023Updated 3 years ago
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆101Apr 30, 2024Updated last year
- A simple PyTorch implementation of Learning Instance Activation Maps for Weakly Supervised Instance Segmentation, in CVPR 2019☆11Jun 18, 2020Updated 5 years ago
- PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"☆101Jun 28, 2023Updated 2 years ago
- Repo of HawkLlama.☆16Jan 2, 2025Updated last year
- ☆14Sep 25, 2016Updated 9 years ago
- Code release for "Learning Video Representations from Large Language Models"☆536Oct 1, 2023Updated 2 years ago
- ☆20Dec 8, 2024Updated last year
- [ICCV2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆41Nov 29, 2023Updated 2 years ago
- Development kit for the CLVISION @ CVPR 2023 Challenge☆17May 27, 2023Updated 2 years ago
- [ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer☆37Oct 18, 2023Updated 2 years ago
- [ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video☆20Jul 29, 2024Updated last year
- Some thoughts about writing scientific papers☆21Nov 8, 2024Updated last year
- Contrastive Learning of Image Representations with Cross-Video Cycle-Consistency☆17Dec 2, 2021Updated 4 years ago