The first work for cross-domain open-vocabulary action recognition with a benchmark
☆20May 27, 2024Updated last year
Alternatives and similar repositories for XOV-Action
Users that are interested in XOV-Action are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official repo for [CVPR 2025] paper, Mitigating the Human-Robot Domain Discrepancy in Visual Pre-training for Robotic Manipul…☆29Mar 31, 2025Updated 11 months ago
- (ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"☆32Apr 8, 2025Updated 11 months ago
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆30Apr 19, 2023Updated 2 years ago
- Exploring High-quality Target Domain Information for Unsupervised Domain Adaptive Semantic Segmentation☆20Nov 22, 2022Updated 3 years ago
- ☆27Jun 11, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Accepted at ICCV '23☆15Oct 4, 2023Updated 2 years ago
- codes for Uncovering Hidden Challenges in Query-Based Video Moment Retrieval☆20Sep 7, 2020Updated 5 years ago
- [ICCV2025] Hierarchical Visual Prompt Learning for Continual Video Instance Segmentation☆14Feb 18, 2026Updated last month
- [CVPR-2023] Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation☆18Jul 2, 2023Updated 2 years ago
- Rethinking Whole-Body CT Image Interpretation: An Abnormality-Centric Approach☆19Nov 17, 2025Updated 4 months ago
- Multiscale Score Matching Analysis☆11Jan 19, 2023Updated 3 years ago
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆52Oct 13, 2024Updated last year
- (ICCV 2023) Official implementation of Rectified Straight Through Estimator (ReSTE).☆31Sep 20, 2024Updated last year
- Repository for ECCV 2022 paper "Source-free Video Domain Adaptation by Learning Temporal Consistency for Action Recognition"☆24Mar 9, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆14Oct 11, 2021Updated 4 years ago
- Official Implementation of "VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning".☆63Nov 20, 2025Updated 4 months ago
- Preoperative to Intraoperative Laparoscopy Fusion☆15Oct 5, 2025Updated 5 months ago
- MICCAI 2022: Free Lunch for Surgical Video Understanding by Distilling Self-Supervisions☆12Sep 17, 2022Updated 3 years ago
- Predicting Ovarian Cancer Treatment Response in Histopathology using Hierarchical Vision Transformers and Multiple Instance Learning☆12Nov 29, 2023Updated 2 years ago
- The Continual Learning in Multimodality Benchmark☆68Jun 24, 2023Updated 2 years ago
- Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos☆13Jun 26, 2023Updated 2 years ago
- ☆51Jun 19, 2024Updated last year
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.☆10Aug 13, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official repository for ACM Multimedia'23 paper "MATK: The Meme Analytical Tool Kit"☆13May 29, 2024Updated last year
- [NeurIPS 2024] PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications☆20Nov 4, 2024Updated last year
- An implementation of soical lstm for pedestrian movement forecasting.☆21Oct 23, 2021Updated 4 years ago
- A curated list of papers & resources linked to concept learning☆12Aug 9, 2023Updated 2 years ago
- [IEEE T-IP 2021] Semantics-aware Adaptive Knowledge Distillation for Cross-modal Action Recognition☆29Jan 6, 2025Updated last year
- [ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition☆301Sep 17, 2023Updated 2 years ago
- ☆12Jan 21, 2019Updated 7 years ago
- Code for 3D object detection for autonomous driving☆14Jan 17, 2020Updated 6 years ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆13Nov 4, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Official implementation for “SafeMVDrive: Multi-view Safety-Critical Driving Video Synthesis in the Real World Domain”☆22Dec 11, 2025Updated 3 months ago
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆16Mar 18, 2026Updated last week
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 2 years ago
- ☆14Nov 26, 2025Updated 4 months ago
- This is the official repo for [CoRL 2024] Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation☆32Oct 30, 2024Updated last year
- Code for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".☆78Mar 7, 2024Updated 2 years ago
- Temporal Pyramid Routing For Video Instance Segmentation-T-PAMI-2022☆25Jul 6, 2023Updated 2 years ago