jyrao / SoccerAgentLinks
[ACM Multimedia 2025] "Multi-Agent System for Comprehensive Soccer Understanding"
☆65Updated 2 months ago
Alternatives and similar repositories for SoccerAgent
Users that are interested in SoccerAgent are comparing it to the libraries listed below
Sorting:
- [EMNLP 2024 Oral] MatchTime: Towards Automatic Soccer Game Commentary Generation☆90Updated last year
- [CVPR2025 Highlight] Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models☆232Updated 2 months ago
- [CVPR 2025] "Towards Universal Soccer Video Understanding".☆207Updated 4 months ago
- This is the official code of VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding (ECCV 2024)☆289Updated last year
- Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models☆277Updated 5 months ago
- [ICCV 2025] Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges☆81Updated 11 months ago
- [ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"☆150Updated last year
- ☆134Updated 9 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆129Updated 6 months ago
- Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR 2024 Best Paper]☆237Updated 3 weeks ago
- ✨✨[NeurIPS 2025] This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehensi…☆387Updated 2 weeks ago
- [ICCV 2025 Highlight] The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"☆190Updated 10 months ago
- Pixel-Level Reasoning Model trained with RL [NeuIPS25]☆267Updated 2 months ago
- 💡 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning☆299Updated 3 months ago
- A Survey on Benchmarks of Multimodal Large Language Models☆146Updated 6 months ago
- [NeurIPS'25] ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding☆47Updated 4 months ago
- Code for CVPR25 paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"☆155Updated 7 months ago
- [ACL 2025 🔥] Rethinking Step-by-step Visual Reasoning in LLMs☆310Updated 8 months ago
- TStar is a unified temporal search framework for long-form video question answering☆84Updated 4 months ago
- Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding☆292Updated 5 months ago
- Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos☆63Updated 4 months ago
- TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning☆113Updated last month
- (ICCV2025) Official repository of paper "ViSpeak: Visual Instruction Feedback in Streaming Videos"☆44Updated 6 months ago
- Video Chain of Thought, Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"☆176Updated 11 months ago
- 🔥🔥MLVU: Multi-task Long Video Understanding Benchmark☆238Updated 5 months ago
- Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning☆28Updated last year
- Long Context Transfer from Language to Vision☆398Updated 10 months ago
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆90Updated last year
- [ACL2025 Oral & Award] Evaluate Image/Video Generation like Humans - Fast, Explainable, Flexible☆114Updated 5 months ago
- Official implementation of paper AdaReTaKe: Adaptive Redundancy Reduction to Perceive Longer for Video-language Understanding☆88Updated 9 months ago