Official Repo for MageBench: Bridging Large Multimodal Models to Agents
☆22Jan 8, 2025Updated last year
Alternatives and similar repositories for MageBench
Users that are interested in MageBench are comparing it to the libraries listed below
Sorting:
- Mitigating Open-Vocabulary Caption Hallucinations (EMNLP 2024)☆18Oct 18, 2024Updated last year
- TrackGPT: Track What You Need in Videos via Text Prompts☆25May 16, 2023Updated 2 years ago
- This repo is reproduction resources for linear alignment paper, still working☆18May 19, 2024Updated last year
- 🎮Manipulates mobile phones just like how you would. Official code for "MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficien…☆27Oct 10, 2025Updated 5 months ago
- ☆13Jan 19, 2026Updated 2 months ago
- Training DIAMOND to play MarioKart64 in a Neural Network.☆30Sep 9, 2025Updated 6 months ago
- Source code for the Paper "Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models"☆19Feb 1, 2026Updated last month
- Debian packaging for NNCP [archived], moved to https://salsa.debian.org/go-team/packages/nncp☆14Feb 18, 2023Updated 3 years ago
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 11 months ago
- An in-context learning research testbed☆19Mar 16, 2025Updated last year
- ☆10Apr 7, 2025Updated 11 months ago
- An automated data pipeline scaling RL to pretraining levels☆74Oct 11, 2025Updated 5 months ago
- [IJCV 2024]☆21Nov 11, 2024Updated last year
- ☆16Feb 12, 2026Updated last month
- [CVPR'25] Official code of paper "Mimic In-Context Learning for Multimodal Tasks"☆24Mar 10, 2026Updated last week
- Rationale-enhanced language models are better continual relation learners (EMNLP 2023 Main Conference)☆12Oct 11, 2023Updated 2 years ago
- Official PyTorch implementation of: "Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in V…☆14Aug 29, 2022Updated 3 years ago
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Dec 13, 2024Updated last year
- [AAAI 2024] Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-Supervised 3D Object Detection☆10Jan 24, 2025Updated last year
- ☆10Jul 30, 2024Updated last year
- ☆13May 26, 2017Updated 8 years ago
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated last year
- Official implementation of the paper "LTrack: Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Rep…☆12Jul 26, 2023Updated 2 years ago
- Synthesize bio-plausible neural networks for cognitive tasks, mimicking brain architecture☆11Apr 14, 2021Updated 4 years ago
- The public reproducible analysis code used for the gaze project☆11Feb 21, 2026Updated last month
- CVMHT : Complementary-View Multiple Human Tracking (AAAI 2020).☆10Dec 9, 2021Updated 4 years ago
- Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"☆13Apr 27, 2022Updated 3 years ago
- The official implementation of the TIP 2025 paper UncTrack: Reliable Visual Object Tracking with Uncertainty-Aware Prototype Memory Netwo…☆14Jun 16, 2025Updated 9 months ago
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆14Jun 26, 2025Updated 8 months ago
- Codebase for Mechanistic Mode Connectivity☆13Jul 14, 2023Updated 2 years ago
- ECCV 2024 DTC Dataset Tooling☆22Jan 12, 2026Updated 2 months ago
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 2 years ago
- Repo for Anonymous purpose, pls don't distribute☆10Oct 2, 2024Updated last year
- [IROS'25] COCMT☆12Aug 14, 2025Updated 7 months ago
- ☆15Jul 9, 2025Updated 8 months ago
- Tight Mutual Information Estimation With Contrastive Fenchel-Legendre Optimization☆11Nov 29, 2022Updated 3 years ago
- [ACL 2025 Findings] "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"☆25Feb 21, 2025Updated last year
- ☆15May 13, 2022Updated 3 years ago
- Automatically replace full publication names in a bibtex database file into official abbreviated names, or reverse. (Support IEEE/ACM/Sci…☆14Jul 30, 2024Updated last year