ycpNotFound / GeoGenLinks
A pipeline for the automatic construction of geometry problems along with step-by-step solutions.
☆14Updated last month
Alternatives and similar repositories for GeoGen
Users that are interested in GeoGen are comparing it to the libraries listed below
Sorting:
- The implement of geometric solver PGPSNet☆29Updated 7 months ago
- The first end-to-end deep learning model for explicit plane geometry diagram parsing.☆50Updated 9 months ago
- ☆72Updated 4 months ago
- [AAAI 2025]Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning☆38Updated 5 months ago
- A Survey of Multimodal Retrieval-Augmented Generation☆19Updated 5 months ago
- Data and Code for ACL 2021 Paper "Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning"☆163Updated 5 months ago
- [ICLR'25] Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training☆46Updated 8 months ago
- An Arena-style Automated Evaluation Benchmark for Detailed Captioning☆55Updated 3 months ago
- An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.☆145Updated 5 months ago
- Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step☆130Updated last month
- ☆12Updated 2 months ago
- This is the Repository for Geometry Problem Solving Method Evaluation☆26Updated 11 months ago
- Latest Advances on Reasoning of Multimodal Large Language Models (Multimodal R1 \ Visual R1) ) 🍓☆34Updated 5 months ago
- MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models☆31Updated 8 months ago
- ☆38Updated 11 months ago
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆99Updated last year
- This repository is the codebase of TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy☆46Updated 11 months ago
- [ACL 2024] ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.☆126Updated last year
- Official Implementation of ACL 2021 paper “GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning”.☆68Updated 3 years ago
- Official code implementation of Perception R1: Pioneering Perception Policy with Reinforcement Learning☆261Updated 2 months ago
- 从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两层MLP投影层连…☆26Updated 7 months ago
- Extrapolating RLVR to General Domains without Verifiers☆163Updated last month
- GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning☆162Updated 4 months ago
- ☆36Updated last year
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆63Updated 10 months ago
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆83Updated 10 months ago
- [CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback☆295Updated last year
- ☆47Updated 7 months ago
- [arXiv 25] Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR☆227Updated 3 weeks ago
- Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"☆55Updated last month