allenai/Lila

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/allenai/Lila)

allenai / Lila

A unified benchmark for math reasoning

☆90

Alternatives and similar repositories for Lila

Users that are interested in Lila are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jasonrute / thoughts-on-ai-for-theorem-proving
View on GitHub
☆27Nov 1, 2021Updated 4 years ago
NeuraSearch / NeurIPS-2022-Submission-3358
View on GitHub
This is the code for the Submission 3358 at NeurIPS 2022.
☆22Dec 21, 2022Updated 3 years ago
wellecks / naturalprover
View on GitHub
NaturalProver: Grounded Mathematical Proof Generation with Language Models
☆40Mar 24, 2023Updated 3 years ago
allenai / numglue
View on GitHub
NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks
☆20May 10, 2022Updated 4 years ago
zhangir-azerbayev / repl
View on GitHub
A simple REPL for Lean 4, returning information about errors and sorries.
☆12Jun 19, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ChengpengLi1003 / DotaMath
View on GitHub
☆30Dec 27, 2024Updated last year
chen-judge / UniGeo
View on GitHub
[EMNLP 22] UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression
☆34Dec 7, 2022Updated 3 years ago
psunlpgroup / MultiHiertt
View on GitHub
Data and code for ACL 2022 paper "MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data"
☆54Oct 22, 2024Updated last year
lupantech / PromptPG
View on GitHub
Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".
☆165Dec 27, 2023Updated 2 years ago
jesse-michael-han / lean-step-public
View on GitHub
Proof artifact co-training for Lean
☆43Dec 29, 2022Updated 3 years ago
RUCAIBox / JiuZhang3.0
View on GitHub
The code and data for the paper JiuZhang3.0
☆49May 26, 2024Updated 2 years ago
ai4reason / ATP_Proofs
View on GitHub
Interesting ATP Proofs
☆13Sep 3, 2021Updated 4 years ago
lupantech / dl4math
View on GitHub
Resources of deep learning for mathematical reasoning (DL4MATH).
☆374Dec 22, 2023Updated 2 years ago
lupantech / ineqmath
View on GitHub
Solving Inequality Proofs with Large Language Models.
☆61Dec 15, 2025Updated 7 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
jasonrute / lean_proof_recording
View on GitHub
Proof recording for Lean 3
☆27Sep 30, 2021Updated 4 years ago
protagolabs / odyssey-math
View on GitHub
☆84Jan 25, 2025Updated last year
cmu-l3 / ntp-toolkit
View on GitHub
Neural theorem proving toolkit: data extraction tools for Lean 4
☆36Jul 13, 2026Updated last week
OpenBMB / OlympiadBench
View on GitHub
[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…
☆195Jun 8, 2025Updated last year
allenai / DrawEduMath
View on GitHub
Can VLMs understand students' hand-drawn math work?
☆19Jan 20, 2026Updated 6 months ago
HKUNLP / subgoal-theorem-prover
View on GitHub
Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"
☆20May 25, 2023Updated 3 years ago
PremiLab-Math / MathCheck
View on GitHub
[ICLR 2025] Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist
☆34Oct 23, 2024Updated last year
Lagooon / LeanSTaR
View on GitHub
☆44Sep 19, 2024Updated last year
chaochun / nlu-asdiv-dataset
View on GitHub
☆52Jul 4, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
TIGER-AI-Lab / CritiqueFineTuning
View on GitHub
Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]
☆182Jul 8, 2025Updated last year
gpoesia / peano
View on GitHub
An environment for learning formal mathematical reasoning from scratch
☆72Aug 18, 2024Updated last year
cmu-l3 / llmlean
View on GitHub
LLMs + Lean, on your laptop or in the cloud
☆213Oct 10, 2025Updated 9 months ago
wenhuchen / TheoremQA
View on GitHub
The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset
☆161Apr 23, 2024Updated 2 years ago
yilunzhao / Awsome-Table-Reasoning
View on GitHub
A comprehensive paper list of Reasoning over Tables.
☆30Nov 6, 2022Updated 3 years ago
zwx980624 / mwp-cl
View on GitHub
☆26Apr 8, 2022Updated 4 years ago
albertqjiang / Portal-to-ISAbelle
View on GitHub
https://albertqjiang.github.io/Portal-to-ISAbelle/
☆58Sep 6, 2023Updated 2 years ago
nusnlp / paraphrasing-squad
View on GitHub
Datasets for the paper "Improving the Robustness of Question Answering Systems to Question Paraphrasing" (ACL 2019)
☆27Aug 7, 2019Updated 6 years ago
dqxiu / PLMs-with-Knowledge
View on GitHub
☆16Apr 11, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jesse-michael-han / lean-tpe-public
View on GitHub
The Lean Theorem Proving Environment
☆15May 7, 2023Updated 3 years ago
BartoszPiotrowski / lean-premise-selection
View on GitHub
☆22Jan 14, 2026Updated 6 months ago
sail-sg / symbolic-instruction-tuning
View on GitHub
The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".
☆65Apr 18, 2023Updated 3 years ago
mandyyyyii / scibench
View on GitHub
☆132Jul 8, 2024Updated 2 years ago
microsoft / ToRA
View on GitHub
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…
☆1,122Feb 22, 2024Updated 2 years ago
cyzhh / MMOS
View on GitHub
Mix of Minimal Optimal Sets (MMOS) of dataset has two advantages for two aspects, higher performance and lower construction costs on math…
☆73Jul 27, 2024Updated last year
FreedomIntelligence / OVM
View on GitHub
☆74Apr 2, 2024Updated 2 years ago