Candice-yu / GeoLauxLinks
A Benchmark for Evaluating MLLMs' Geometry Performance on Long-Step Problems Requiring Auxiliary Lines
☆30Updated 2 months ago
Alternatives and similar repositories for GeoLaux
Users that are interested in GeoLaux are comparing it to the libraries listed below
Sorting:
- We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…☆199Updated last week
- Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision☆170Updated 2 weeks ago
- The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"☆111Updated last week
- ☆55Updated 3 months ago
- A Collection of Papers on Diffusion Language Models☆145Updated 2 months ago
- Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆79Updated 4 months ago
- Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)☆194Updated 3 months ago
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆185Updated last month
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆30Updated 4 months ago
- GPT as a Monte Carlo Language Tree: A Probabilistic Perspective☆45Updated 10 months ago
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆62Updated 2 months ago
- Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".☆33Updated 4 months ago
- Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆57Updated 5 months ago
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆36Updated 5 months ago
- Doodling our way to AGI ✏️ 🖼️ 🧠☆113Updated 6 months ago
- ✈️ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints☆76Updated 4 months ago
- Official implement of MIA-DPO☆67Updated 10 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆205Updated last month
- Official codebase for the paper Latent Visual Reasoning☆42Updated last month
- ☆78Updated 5 months ago
- ☆283Updated last month
- ☆134Updated last week
- A tiny paper rating web☆38Updated 8 months ago
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.☆31Updated 3 months ago
- ☆31Updated 3 months ago
- TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models☆60Updated last week
- [ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models☆89Updated last year
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 10 months ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Updated 7 months ago
- Assessing Context-Aware Creative Intelligence in MLLMs☆23Updated 4 months ago