Video Test-Time Adaptation for Action Recognition (CVPR 2023)
☆52Oct 13, 2024Updated last year
Alternatives and similar repositories for ViTTA
Users that are interested in ViTTA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for ECCV 2022 paper "Source-free Video Domain Adaptation by Learning Temporal Consistency for Action Recognition"☆24Mar 9, 2023Updated 3 years ago
- A comprehensive collection of awesome research and other items about video domain adaptation☆114Jan 18, 2025Updated last year
- ☆21May 29, 2023Updated 2 years ago
- [ECCV 2024] Official code release for "Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition"☆42Mar 24, 2025Updated last year
- Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs (ECCV 2024)☆19Jul 15, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Accepted at ICCV '23☆15Oct 4, 2023Updated 2 years ago
- The official implementation of "Low-power, Continuous Remote Behavioral Localization with Event Cameras" (CVPR 2024)☆12Sep 25, 2024Updated last year
- AFNet(NeurIPS 2022)☆20Nov 24, 2022Updated 3 years ago
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37May 23, 2023Updated 2 years ago
- Code for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".☆77Mar 7, 2024Updated 2 years ago
- Task-adaptive Spatial-Temporal Video Sampler for Few-shot Action Recognition☆14Dec 22, 2022Updated 3 years ago
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Oct 26, 2021Updated 4 years ago
- Official implementation of the ECCV 2022 paper "CMD: Self-supervised 3D Action Representation Learning with Cross-modal Mutual Distillati…☆37Oct 5, 2022Updated 3 years ago
- ☆194Oct 22, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The first work for cross-domain open-vocabulary action recognition with a benchmark☆21May 27, 2024Updated last year
- DA-AIM: Exploiting Instance-based Mixed Sampling via Auxiliary Source Domain Supervision for Domain-adaptive Action Detection☆12Oct 6, 2022Updated 3 years ago
- TAM: Temporal Adaptive Module for Video Recognition☆208Aug 18, 2022Updated 3 years ago
- Official code of the MSF model for GZSSAR (ICIG 2023)☆14Jan 3, 2026Updated 3 months ago
- [ICLR 2025] Towards Robust Multimodal Open-set Test-time Adaptation via Adaptive Entropy-aware Optimization☆23May 23, 2025Updated 10 months ago
- [ICCV 2025] Official repository of TDSM☆29Oct 17, 2025Updated 5 months ago
- Official implementation of Lightweight Human Pose Estimation Using Loss Weighted by Target Heatmap that was honorably mentioned as Best P…☆11Dec 17, 2023Updated 2 years ago
- [CVPR'22] DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition☆27Sep 28, 2022Updated 3 years ago
- Distribution Aware Tuning☆16Aug 29, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition☆98Jan 14, 2025Updated last year
- Collection of awesome test-time (domain/batch/instance) adaptation methods☆1,256Nov 14, 2025Updated 5 months ago
- 5th CLVISION workshop at CVPR: repo for the challenge☆19May 13, 2024Updated last year
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆26Apr 14, 2025Updated last year
- [CVPR 2023] Robust Test-Time Adaptation in Dynamic Scenarios. https://arxiv.org/abs/2303.13899☆71Jul 11, 2023Updated 2 years ago
- ☆18Nov 19, 2024Updated last year
- Code for "TSGCNeXt: Dynamic-Static Multi-Graph Convolution for Efficient Skeleton-Based Action Recognition with Long-term Learning Potent…☆38Sep 12, 2025Updated 7 months ago
- [IEEE T-IP 2021] Semantics-aware Adaptive Knowledge Distillation for Cross-modal Action Recognition☆29Jan 6, 2025Updated last year
- The official project website of "Ske2Grid: Skeleton-to-Grid Representation Learning for Action Recognition" (The paper of Ske2Grid is pub…☆19Sep 6, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official implement of VAD-LLaMA☆19Sep 10, 2024Updated last year
- Multi-grained Spatio-Temporal Features Perceived Network for Event-based Lip-Reading (CVPR 2022)☆13Jun 18, 2022Updated 3 years ago
- Codebase for "Revisiting spatio-temporal layouts for compositional action recognition" (Oral at BMVC 2021).☆27Apr 3, 2022Updated 4 years ago
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆35Sep 9, 2024Updated last year
- Temporal Relational Modeling with Self-Supervision for Action Segmentation☆20Feb 7, 2021Updated 5 years ago
- PyTorch demo code for "Spatial-Temporal Pyramid Based Convolutional Neural Network for Action Recognition"☆15Oct 17, 2018Updated 7 years ago
- Continual Learning Toolbox for Computer Vision Tasks☆21Oct 3, 2023Updated 2 years ago