The first work for cross-domain open-vocabulary action recognition with a benchmark
☆20May 27, 2024Updated last year
Alternatives and similar repositories for XOV-Action
Users that are interested in XOV-Action are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023] Diversifying Spatial-Temporal Perception for Video Domain Generalization☆17Oct 26, 2023Updated 2 years ago
- ☆27Jun 11, 2022Updated 3 years ago
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆30Apr 19, 2023Updated 2 years ago
- Official repository for ACM Multimedia'23 paper "MATK: The Meme Analytical Tool Kit"☆13May 29, 2024Updated last year
- ☆14Nov 26, 2025Updated 3 months ago
- Welcome to CV-PCL Viewer! This software has simple image and video processing functions, as well as the ability to visualize point cloud …☆16Jul 20, 2024Updated last year
- (ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"☆32Apr 8, 2025Updated 10 months ago
- Official Implementation of "VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning".☆63Nov 20, 2025Updated 3 months ago
- Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos☆13Jun 26, 2023Updated 2 years ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆19Jul 10, 2025Updated 7 months ago
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆27Feb 28, 2026Updated last week
- [ICCV2025] Hierarchical Visual Prompt Learning for Continual Video Instance Segmentation☆14Feb 18, 2026Updated 2 weeks ago
- ☆11Dec 6, 2024Updated last year
- ☆10Jul 26, 2025Updated 7 months ago
- Official implementation for “SafeMVDrive: Multi-view Safety-Critical Driving Video Synthesis in the Real World Domain”☆21Dec 11, 2025Updated 2 months ago
- [NeurIPS 2025] Panoptic Captioning: An Equivalence Bridge for Image and Text☆33Jan 31, 2026Updated last month
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.☆10Aug 13, 2023Updated 2 years ago
- Weakly Supervised Referring Video Object Segmentation with Object-Centric Pseudo-Guidance☆10Aug 17, 2024Updated last year
- Multiscale Score Matching Analysis☆12Jan 19, 2023Updated 3 years ago
- Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆15Nov 18, 2025Updated 3 months ago
- ☆14Jul 6, 2022Updated 3 years ago
- Initial commit☆12Aug 14, 2023Updated 2 years ago
- ☆12Jan 21, 2019Updated 7 years ago
- Delving into the Continuous Domain Adaptation (ACM MM22)☆12Jul 10, 2022Updated 3 years ago
- Basic Vacuum Cleaner World Problem in Artificial Intelligence that describes how any action is perfomed after sensing the environment.☆11Feb 11, 2019Updated 7 years ago
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆21Jun 23, 2025Updated 8 months ago
- Trial version for prs platform (python project). Please note that the complete experience requires downloading the Unity resource.☆10Jun 26, 2024Updated last year
- Official repository of "TDSD: Text-Driven Scene-Decoupled Weakly Supervised Video Anomaly Detection"☆11May 25, 2025Updated 9 months ago
- ☆15Dec 2, 2025Updated 3 months ago
- ☆48Sep 22, 2023Updated 2 years ago
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 2 years ago
- Predicting Ovarian Cancer Treatment Response in Histopathology using Hierarchical Vision Transformers and Multiple Instance Learning☆12Nov 29, 2023Updated 2 years ago
- Pre-processing and execution scripts for experiments on automatic translation of sign language focusing on the gloss-to-text. Research do…☆11May 21, 2022Updated 3 years ago
- This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"☆13Aug 22, 2025Updated 6 months ago
- Reinforcement Learning based trading bot (Open AI Gym, Stable Baselines)☆10Mar 24, 2023Updated 2 years ago
- Personal DeepLearning project☆13Jun 2, 2021Updated 4 years ago
- LLaVA-Next for STVG☆18Dec 5, 2025Updated 3 months ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆13Nov 4, 2023Updated 2 years ago
- [WACV 2024] TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding☆13May 30, 2024Updated last year