The first work for cross-domain open-vocabulary action recognition with a benchmark
☆21May 27, 2024Updated 2 years ago
Alternatives and similar repositories for XOV-Action
Users that are interested in XOV-Action are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2023] Diversifying Spatial-Temporal Perception for Video Domain Generalization☆17Oct 26, 2023Updated 2 years ago
- (TCSVT 2024) Official PyTorch implementation of paper "Continual Action Assessment via Task-Consistent Score-Discriminative Feature Distr…☆12Mar 19, 2025Updated last year
- This is the official repo for [CVPR 2025] paper, Mitigating the Human-Robot Domain Discrepancy in Visual Pre-training for Robotic Manipul…☆31Mar 31, 2025Updated last year
- (ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"☆32Apr 8, 2025Updated last year
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆30Apr 19, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆12Dec 8, 2016Updated 9 years ago
- [NeurIPS 2025] Panoptic Captioning: An Equivalence Bridge for Image and Text☆37Jan 31, 2026Updated 3 months ago
- This is the official Pytorch code for our paper "Artemis: Structured Visual Reasoning for Perception Policy Learning".☆14Dec 4, 2025Updated 5 months ago
- Accepted at ICCV '23☆16Oct 4, 2023Updated 2 years ago
- Welcome to CV-PCL Viewer! This software has simple image and video processing functions, as well as the ability to visualize point cloud …☆16Jul 20, 2024Updated last year
- codes for Uncovering Hidden Challenges in Query-Based Video Moment Retrieval☆20Sep 7, 2020Updated 5 years ago
- [ICCV2025] Hierarchical Visual Prompt Learning for Continual Video Instance Segmentation☆14Feb 18, 2026Updated 3 months ago
- [CVPR-2023] Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation☆18Jul 2, 2023Updated 2 years ago
- ☆21May 29, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Project page for the 'CLAWS: Clustering Assisted Weakly Supervised Learning with Normalcy Suppression for Anomalous Event Detection', ECC…☆12May 29, 2021Updated 4 years ago
- VLM benchmarks for robot manipulation tasks☆22Apr 30, 2025Updated last year
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆53Oct 13, 2024Updated last year
- (ICCV 2023) Official implementation of Rectified Straight Through Estimator (ReSTE).☆34Sep 20, 2024Updated last year
- [ICCV 2025] Official repository of TDSM☆32Oct 17, 2025Updated 7 months ago
- Preoperative to Intraoperative Laparoscopy Fusion☆15Oct 5, 2025Updated 7 months ago
- Official Implementation of "VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning".☆68Nov 20, 2025Updated 6 months ago
- MICCAI 2022: Free Lunch for Surgical Video Understanding by Distilling Self-Supervisions☆13Sep 17, 2022Updated 3 years ago
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆36Feb 28, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Predicting Ovarian Cancer Treatment Response in Histopathology using Hierarchical Vision Transformers and Multiple Instance Learning☆12Nov 29, 2023Updated 2 years ago
- The Continual Learning in Multimodality Benchmark☆67Jun 24, 2023Updated 2 years ago
- Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos☆13Jun 26, 2023Updated 2 years ago
- ☆53Jun 19, 2024Updated last year
- This is the official code repo for GLOVER and GLOVER++.☆55Aug 6, 2025Updated 9 months ago
- An implementation of soical lstm for pedestrian movement forecasting.☆22Oct 23, 2021Updated 4 years ago
- [IEEE T-IP 2021] Semantics-aware Adaptive Knowledge Distillation for Cross-modal Action Recognition☆29Jan 6, 2025Updated last year
- ☆12Jan 21, 2019Updated 7 years ago
- [ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition☆300Sep 17, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for 3D object detection for autonomous driving☆14Jan 17, 2020Updated 6 years ago
- [ICML 2026] SafeMVDrive: Multi-view Safety-Critical Driving Video Synthesis in the Real World Domain☆26May 2, 2026Updated 3 weeks ago
- Framework for simulating deficiencies and other aspects of the human visual system☆13Jun 2, 2024Updated last year
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 3 years ago
- ☆14Nov 26, 2025Updated 6 months ago
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆18Mar 18, 2026Updated 2 months ago
- Metappearance: Meta-Learning for Visual Appearance Reproduction☆21Sep 19, 2022Updated 3 years ago