chen-judge / UniGeo
[EMNLP 22] UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression
☆28Updated 2 years ago
Alternatives and similar repositories for UniGeo
Users that are interested in UniGeo are comparing it to the libraries listed below
Sorting:
- Official Implementation of ACL 2021 paper “GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning”.☆62Updated 3 years ago
- ☆15Updated last year
- Data and Code for ACL 2021 Paper "Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning"☆148Updated last month
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆24Updated last year
- ☆51Updated 4 months ago
- Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.☆25Updated last year
- ☆96Updated last year
- Source code for the paper "Prefix Language Models are Unified Modal Learners"☆43Updated 2 years ago
- The code and data for the paper JiuZhang3.0☆44Updated 11 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆120Updated 8 months ago
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆61Updated 5 months ago
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆49Updated 6 months ago
- ☆17Updated last year
- [ICLR'25] Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training☆32Updated 3 months ago
- ☆29Updated 4 months ago
- Llemma formal2formal (tactic prediction) theorem proving experiments☆20Updated last year
- Extending context length of visual language models☆11Updated 4 months ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆22Updated last month
- Visual and Embodied Concepts evaluation benchmark☆21Updated last year
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)☆31Updated last year
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆23Updated last year
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆28Updated 10 months ago
- [ICML'24] TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks☆26Updated 7 months ago
- Collections of RLxLM experiments using minimal codes☆12Updated 2 months ago
- official repo for the paper "Learning From Mistakes Makes LLM Better Reasoner"☆59Updated last year
- Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".☆152Updated last year
- ☆14Updated 10 months ago
- ☆25Updated 8 months ago
- ☆14Updated 5 months ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆81Updated 9 months ago