Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMs
☆45Jun 17, 2025Updated 8 months ago
Alternatives and similar repositories for MME-Reasoning
Users that are interested in MME-Reasoning are comparing it to the libraries listed below
Sorting:
- (ICCV-2025 Official Code)) Improving Generalist Model with Domain-Specific Experts☆87Oct 29, 2025Updated 4 months ago
- ☆13Apr 23, 2025Updated 10 months ago
- ☆24Jun 18, 2025Updated 8 months ago
- Vision-Language based Visual Object Tracking☆27Oct 10, 2025Updated 4 months ago
- [T-PAMI 2024] & [CVPR 2023] Vote2Cap-DETR; A set-to-set perspective towards 3D Dense Captioning; State-of-the-Art 3D Dense Captioning met…☆104Aug 17, 2024Updated last year
- [ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling☆32Feb 1, 2026Updated 3 weeks ago
- MLEvolve is an open-source autonomous system for end-to-end machine learning algorithm design and optimization powered by progressive sea…☆112Feb 16, 2026Updated last week
- [Blog 1] Recording a bug of grpo_trainer in some R1 projects☆22Feb 23, 2025Updated last year
- Official Repository of OmniCaptioner☆169Apr 23, 2025Updated 10 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆34Sep 1, 2025Updated 5 months ago
- [ICML 2025] Official Repo for Stability-guided Adaptive Diffusion Acceleration. 🚀🌙Accelerating off-the-shelf diffusion model with a uni…☆39Jul 24, 2025Updated 7 months ago
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- The official implementation of "Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers" (arXiv …☆50Jun 6, 2025Updated 8 months ago
- Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding"☆57Jan 23, 2026Updated last month
- PhyX: Does Your Model Have the "Wits" for Physical Reasoning?☆51Feb 15, 2026Updated last week
- image retrieval using metric learning☆10Nov 22, 2022Updated 3 years ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆86Nov 10, 2024Updated last year
- official implementation of β-DARTS: Beta-Decay Regularization for Differentiable Architecture Search (CVPR22 oral).☆86Mar 29, 2022Updated 3 years ago
- several examples of the learning of the java☆11Nov 22, 2023Updated 2 years ago
- ☆25Aug 19, 2025Updated 6 months ago
- [ICCV 2025] Official code for paper: Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs☆68Jul 1, 2025Updated 7 months ago
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning☆25Jan 5, 2026Updated last month
- ☆10Feb 22, 2022Updated 4 years ago
- A Google Chrome Extension that replaces the official New Tab page with a beautiful to-do list.☆12Mar 7, 2018Updated 7 years ago
- Code for ICML21 paper "Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation"☆12Feb 8, 2023Updated 3 years ago
- Disable YubiKey output on MacOS without a modifier key pressed☆10Aug 10, 2022Updated 3 years ago
- Dynamic Context Selection for Efficient Long-Context LLMs☆56May 20, 2025Updated 9 months ago
- ECG analysis to classify anterior myocardial infarction cases.☆10May 17, 2017Updated 8 years ago
- ☆10Nov 17, 2022Updated 3 years ago
- [ICLR 2025] Dobi-SVD : Differentiable SVD for LLM Compression and Some New Perspectives"☆50Oct 19, 2025Updated 4 months ago
- Evaluating AlexNet features at various depths☆40Oct 13, 2020Updated 5 years ago
- A collection of VLMs papers, blogs, and projects, with a focus on VLMs in Autonomous Driving and related reasoning techniques.☆11Nov 16, 2024Updated last year
- ☆11Feb 28, 2024Updated 2 years ago
- ☆14Sep 6, 2024Updated last year
- This is the source code for Efficient Sequential Recommendation for Long Term User Interest Via Personalization.☆22Nov 18, 2025Updated 3 months ago
- Adaptive Topology Reconstruction for Robust Graph Representation Learning [Efficient ML Model]☆10Feb 11, 2025Updated last year
- ☆21Sep 25, 2025Updated 5 months ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Mar 6, 2023Updated 2 years ago