chen-judge / UniGeo
[EMNLP 22] UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression
☆27Updated 2 years ago
Alternatives and similar repositories for UniGeo:
Users that are interested in UniGeo are comparing it to the libraries listed below
- Official Implementation of ACL 2021 paper “GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning”.☆53Updated 3 years ago
- Data and Code for ACL 2021 Paper "Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning"☆134Updated 2 months ago
- ☆14Updated 9 months ago
- ☆47Updated last month
- ☆16Updated last year
- The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".☆63Updated last year
- Implementation of the methods described in our paper "Explicit Planning Helps Language Models in Logical Reasoning"☆22Updated last year
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆21Updated 2 months ago
- Source code for the paper "Prefix Language Models are Unified Modal Learners"☆43Updated last year
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Updated last year
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆23Updated last year
- ☆49Updated last year
- The code and data for the paper JiuZhang3.0☆40Updated 8 months ago
- ☆23Updated 5 months ago
- official repo for the paper "Learning From Mistakes Makes LLM Better Reasoner"☆59Updated last year
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆58Updated 2 years ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆22Updated 11 months ago
- Data and code for NeurIPS 2021 Paper "IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning".☆51Updated last year
- ☆93Updated last year
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆30Updated last year
- ☆82Updated 3 weeks ago
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆48Updated 2 months ago
- Code & data for ICLR 2024 spotlight paper: 🍯MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data☆39Updated 8 months ago
- The implement of geometric solver PGPSNet☆22Updated 3 weeks ago
- ☆60Updated 2 years ago
- Our code will be public soon .☆26Updated last year
- Visual and Embodied Concepts evaluation benchmark☆21Updated last year
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆63Updated 2 years ago
- A unified benchmark for math reasoning☆87Updated 2 years ago
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆26Updated 7 months ago