[ICLR 2025] SPORTU: A Comprehensive Sports Understanding Benchmark for Multimodal Large Language Models
☆17Sep 17, 2025Updated 5 months ago
Alternatives and similar repositories for SPORTU
Users that are interested in SPORTU are comparing it to the libraries listed below
Sorting:
- ☆23Dec 22, 2024Updated last year
- A Lightweight Visual Reasoning Benchmark for Evaluating Large Multimodal Models through Complex Diagrams in Coding Tasks☆14Feb 25, 2025Updated last year
- official repo for AGNOSTOS, a cross-task manipulation benchmark, and X-ICM method, a cross-task in-context manipulation (VLA) method☆61Nov 26, 2025Updated 3 months ago
- Upload a video and provide a prompt to generate a narration.☆11Mar 5, 2025Updated last year
- ☆13Nov 5, 2024Updated last year
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆27Updated this week
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆24Nov 29, 2025Updated 3 months ago
- ☆13Apr 30, 2025Updated 10 months ago
- [EMNLP2023]: MIRACLE: Towards Personalized Dialogue Generation with Latent-Space Multiple Personal Attribute Control☆12Nov 11, 2023Updated 2 years ago
- [SIGIR 2025] Benchmarking Recommendation, Classification, and Tracing Based on Hugging Face Knowledge Graph☆16Jun 6, 2025Updated 9 months ago
- the datasets of our paper☆11Feb 26, 2024Updated 2 years ago
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆43Dec 9, 2024Updated last year
- This project aims at adjusting the VideoPose3D project from Dario Pavllo, in order to track the trajectories of multiple people and predi…☆10Mar 28, 2021Updated 4 years ago
- ☆14Mar 5, 2024Updated 2 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- LLM as World Models using Bayesian inference☆16May 27, 2025Updated 9 months ago
- ☆14Jun 2, 2025Updated 9 months ago
- Multi-Aspect Controllable Text Generation with Disentangled Counterfactual Augmentation, ACL 2024 (main)☆13Sep 23, 2024Updated last year
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 2 months ago
- Preprint | Previously at GenBio ICML 2025☆18Aug 20, 2025Updated 6 months ago
- Graph-based neural tactic prediction models for Coq.☆15Sep 17, 2025Updated 5 months ago
- ☆13Jun 18, 2024Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Jun 28, 2024Updated last year
- Join multiple photos with pure javascript library☆12Jul 11, 2015Updated 10 years ago
- [ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?☆12Apr 11, 2025Updated 10 months ago
- Universal Reasoning Model☆125Jan 15, 2026Updated last month
- ☆59Nov 18, 2024Updated last year
- (ACM MM24) This is the offical repository of GIST: Improving Parameter Efficient Fine Tuning via Knowledge Interaction.☆11Jan 28, 2024Updated 2 years ago
- Code for the paper "Symmetric Machine Theory of Mind", presented at ICML 2022.☆12Jul 18, 2022Updated 3 years ago
- Spacedrive native dependencies☆13Apr 8, 2025Updated 10 months ago
- ☆13May 28, 2025Updated 9 months ago
- ☆21Jun 4, 2025Updated 9 months ago
- Aligning Agentic World Models via Knowledgeable Experience Learning☆31Jan 25, 2026Updated last month
- Annotations for the Mistake Detection benchmark of Assembly101☆10Aug 3, 2023Updated 2 years ago
- ☆10Sep 6, 2024Updated last year
- GraphRag vs Embeddings☆16Jul 14, 2024Updated last year
- A Decade of Action Quality Assessment: Largest Systematic Survey of Trends, Challenges, and Future Directions☆15Jan 22, 2026Updated last month
- Codebase for "Linking Surface Facts to Large-Scale Knowledge Graphs" (EMNLP 2023)☆13May 8, 2024Updated last year
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆34Jul 3, 2025Updated 8 months ago