M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning
☆46Jul 17, 2025Updated 7 months ago
Alternatives and similar repositories for M2-Reasoning
Users that are interested in M2-Reasoning are comparing it to the libraries listed below
Sorting:
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- ☆33Jul 15, 2025Updated 7 months ago
- MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research☆22Sep 23, 2025Updated 5 months ago
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆15Aug 15, 2025Updated 6 months ago
- [ICCV'25] FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model☆84Jul 24, 2025Updated 7 months ago
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆85Jan 21, 2026Updated last month
- [ICLR 2026] InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models☆47Feb 12, 2026Updated 2 weeks ago
- [MTI-LLM@NeurIPS 2025] Official implementation of "PyVision: Agentic Vision with Dynamic Tooling."☆150Jul 22, 2025Updated 7 months ago
- [ICCV'25] ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment☆36Oct 5, 2025Updated 4 months ago
- VHTest☆15Oct 31, 2024Updated last year
- MegaRAG: Multimodal Graph-based RAG☆36Sep 16, 2025Updated 5 months ago
- [CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆108May 29, 2025Updated 9 months ago
- ☆18Apr 18, 2025Updated 10 months ago
- ☆25Jun 18, 2025Updated 8 months ago
- [ACL'25 Oral] Code for the paper "UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban…☆26Jul 15, 2025Updated 7 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Oct 9, 2025Updated 4 months ago
- A Comprehensive Dataset for Advanced Image Generation and Editing}☆31Oct 2, 2025Updated 5 months ago
- ☆18Oct 28, 2025Updated 4 months ago
- The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"☆19Jan 25, 2025Updated last year
- This repository contains code and datasets for our paper on the effects of document multiplicity while the context size is fixed in Retri…☆18Mar 13, 2025Updated 11 months ago
- ☆23Jul 2, 2025Updated 8 months ago
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆13Jun 25, 2024Updated last year
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆35Aug 28, 2025Updated 6 months ago
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆23Oct 22, 2025Updated 4 months ago
- [ICLR'26] Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology☆75Jan 26, 2026Updated last month
- ☆39Jul 23, 2025Updated 7 months ago
- ☆46Jun 11, 2025Updated 8 months ago
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Jun 28, 2024Updated last year
- ☆36Oct 9, 2025Updated 4 months ago
- ☆23Apr 19, 2024Updated last year
- Test-time Scaling for VAR models☆31Sep 19, 2025Updated 5 months ago
- ☆29Jul 7, 2025Updated 7 months ago
- Multimodal RewardBench☆62Feb 21, 2025Updated last year
- ☆111Jan 8, 2025Updated last year
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆38Jun 23, 2025Updated 8 months ago
- [NeurIPS 2025] Reasoning MLLM, Share-GRPO, advantage vanishing, sparse reward☆36Sep 19, 2025Updated 5 months ago
- Orienting Latent Actions for Video World Modeling☆77Feb 11, 2026Updated 2 weeks ago
- DELT: Data Efficacy for Language Model Training☆43Feb 12, 2026Updated 2 weeks ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆48Jan 8, 2026Updated last month