heldJan / X-VARS
X-VARS is a multi-modal large language model designed for understanding football videos from the point of view of a referee. X-VARS can perform a multitude of tasks, including video description, question answering, action recognition, and conducting meaningful conversations based on video content.
☆13Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for X-VARS
- ☆33Updated this week
- Code for Spotting Temporally Precise, Fine-Grained Events in Video☆51Updated last year
- Repository containing all necessary codes to get started on the SoccerNet Dense Video Captioning challenge.☆28Updated 7 months ago
- Repository containing all necessary codes to get started on the SoccerNet Re-Identification challenge. This repository also contains benc…☆52Updated last year
- The official implementation of our paper "Sports Video Analysis on Large-scale Data" (https://arxiv.org/abs/2208.04897)☆60Updated last year
- Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024☆43Updated 2 months ago
- ☆44Updated 10 months ago
- ☆11Updated 8 months ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆110Updated last year
- [ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"☆30Updated last month
- Repository containing all necessary codes to get started on the SoccerNet Action Spotting challenge. This repository also contains severa…☆63Updated 9 months ago
- Official webpage for TeD-SPAD: Temporal Distinctiveness for Self-supervised Privacy-preservation for video Anomaly Detection, accepted at…☆15Updated 3 months ago
- [CVPR 2024] SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos☆11Updated 6 months ago
- [ECCV 2024] PyTorch implementation of CropMAE, introduced in "Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders"☆44Updated 4 months ago
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆61Updated 7 months ago
- Official implementation of paper - PnLCalib: Sports Field Registration via Points and Lines Optimization☆14Updated last week
- VidPress Sports☆27Updated last year
- The official code for Relational Context Learning for Human-Object Interaction Detection, CVPR2023.☆48Updated last year
- Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos☆10Updated 2 months ago
- ☆22Updated 3 weeks ago
- ECCV2022 Towards Hard-Positive Query Mining for DETR-based Human-Object Interaction Detection☆27Updated last year
- BEAR: a new BEnchmark on video Action Recognition☆42Updated 7 months ago
- ☆30Updated last year
- Repository containing all necessary codes to get started on the SoccerNet Tracking challenge. This repository also contains benchmark met…☆71Updated last year
- CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models☆57Updated 8 months ago
- Official Implementation of SnAG (CVPR 2024)☆37Updated 3 weeks ago
- Action Scene Graphs for Long-Form Understanding of Egocentric Videos (CVPR 2024)☆30Updated last month
- [ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation☆26Updated last month
- ☆12Updated last year
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆27Updated last year