arturxe2 / ASTRA
PyTorch Implementation of "ASTRA: An Action Spotting TRAnsformer for Soccer Videos", ACM MMSports 2023. | 3rd place solution for SoccerNet Action Spotting Challenge 2023.
☆39Updated 11 months ago
Alternatives and similar repositories for ASTRA
Users that are interested in ASTRA are comparing it to the libraries listed below
Sorting:
- VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models☆32Updated last month
- Official PyTorch implementation of "No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding"☆32Updated 11 months ago
- ☆68Updated 10 months ago
- Dataset and Code for CVSports at CVPR 2024 paper "AutoSoccerPose: Automated 3D posture Analysis of Soccer Shot Movements"☆40Updated 10 months ago
- Repository containing all necessary codes to get started on the SoccerNet Dense Video Captioning challenge.☆30Updated last year
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆20Updated 6 months ago
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆60Updated 2 months ago
- ☆67Updated last month
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆21Updated last month
- [ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling☆31Updated 6 months ago
- [Pattern Recognition 2024] Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models, Dong Li, Jiandon…☆17Updated 3 months ago
- Multi-vision Sensor Perception and Reasoning (MS-PR) benchmark, assessing VLMs on their capacity for sensor-specific reasoning.☆15Updated 2 months ago
- AAPL: Adding Attributes to Prompt Learning for Vision-Language Models (CVPRw 2024)☆34Updated last year
- CAVIS: Context-Aware Video Instance Segmentation☆85Updated last month
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆60Updated 4 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated 9 months ago
- Tennis Detection and Visualization System An advanced computer vision system for tennis match analysis that tracks players and ball move…☆14Updated 3 weeks ago
- [ICCV2023] MixSort: The Customized Tracker in SportsMOT☆80Updated last year
- Official Code for "MITracker: Multi-View Integration for Visual Object Tracking"☆67Updated this week
- Official Pytorch Implementation of Self-emerging Token Labeling☆33Updated last year
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆27Updated 8 months ago
- Tracking through Containers and Occluders in the Wild (CVPR 2023) - Official Implementation☆41Updated 11 months ago
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆56Updated last year
- 3D Traffic Light & Sign Dataset☆18Updated last month
- This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories☆27Updated last month
- Spatio-Temporal MLP-Graph Network for 3D Human Pose Estimation☆23Updated last year
- TensorFlow code for our ECCV'24 Workshop paper "LightAvatar: Efficient Head Avatar as Dynamic NeLF"☆28Updated 6 months ago
- [CVPR25] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"☆175Updated last month
- VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning☆121Updated last week
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆63Updated 9 months ago