ycpNotFound / GeoGenLinks
A pipeline for the automatic construction of geometry problems along with step-by-step solutions.
☆14Updated 2 months ago
Alternatives and similar repositories for GeoGen
Users that are interested in GeoGen are comparing it to the libraries listed below
Sorting:
- The implement of geometric solver PGPSNet☆29Updated 9 months ago
- The first end-to-end deep learning model for explicit plane geometry diagram parsing.☆50Updated 10 months ago
- ☆37Updated last year
- Data and Code for ACL 2021 Paper "Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning"☆167Updated 7 months ago
- [ACL 2024] ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.☆130Updated last year
- A Survey of Multimodal Retrieval-Augmented Generation☆19Updated last week
- 【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling☆126Updated 5 months ago
- [ICLR'25] Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training☆46Updated 9 months ago
- This repository is the codebase of TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy☆47Updated last year
- ☆72Updated 5 months ago
- This is the Repository for Geometry Problem Solving Method Evaluation☆26Updated last year
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆63Updated last year
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆97Updated last year
- Official Implementation of ACL 2021 paper “GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning”.☆68Updated 3 years ago
- ☆13Updated 3 months ago
- 从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两层MLP投影层连…☆26Updated 8 months ago
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆102Updated last month
- ☆31Updated 3 months ago
- Official repository for "TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving"☆20Updated 2 months ago
- MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models☆31Updated 9 months ago
- [ACL'2024 Findings] GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation☆69Updated last year
- [arXiv 25] Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR☆236Updated 2 months ago
- Latest Advances on Reasoning of Multimodal Large Language Models (Multimodal R1 \ Visual R1) ) 🍓☆35Updated 7 months ago
- An Arena-style Automated Evaluation Benchmark for Detailed Captioning☆56Updated 5 months ago
- Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"☆57Updated 3 months ago
- Oracle Bone Script data collected by VLRLab of HUST☆58Updated last year
- AI-assisted Deciphering Oracle Bone Script☆59Updated 2 months ago
- GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning☆167Updated 3 weeks ago
- [AAAI 2025]Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning☆41Updated 6 months ago
- ☆47Updated 8 months ago