wenhuchen/TheoremQA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wenhuchen/TheoremQA)

wenhuchen / TheoremQA

The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset

☆161

Alternatives and similar repositories for TheoremQA

Users that are interested in TheoremQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TIGER-AI-Lab / TheoremQA
View on GitHub
The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)
☆40May 15, 2024Updated 2 years ago
mandyyyyii / scibench
View on GitHub
☆132Jul 8, 2024Updated 2 years ago
TIGER-AI-Lab / Program-of-Thoughts
View on GitHub
Data and Code for Program of Thoughts [TMLR 2023]
☆317May 15, 2024Updated 2 years ago
oriyor / turning_tables
View on GitHub
Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…
☆22Nov 2, 2021Updated 4 years ago
qiancheng0 / CREATOR
View on GitHub
This is the repository for paper "CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models"
☆31Oct 8, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
OFA-Sys / gsm8k-ScRel
View on GitHub
Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
☆269Sep 12, 2024Updated last year
lupantech / dl4math
View on GitHub
Resources of deep learning for mathematical reasoning (DL4MATH).
☆374Dec 22, 2023Updated 2 years ago
ChengpengLi1003 / DotaMath
View on GitHub
☆30Dec 27, 2024Updated last year
feyzaakyurek / rl4f
View on GitHub
Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.
☆63Nov 27, 2024Updated last year
wenhuchen / Time-Sensitive-QA
View on GitHub
Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"
☆77Mar 3, 2022Updated 4 years ago
salesforce / factualNLG
View on GitHub
Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"
☆60Jun 2, 2026Updated last month
lupantech / PromptPG
View on GitHub
Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".
☆165Dec 27, 2023Updated 2 years ago
whyNLP / Conic10K
View on GitHub
Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.
☆33Dec 6, 2023Updated 2 years ago
jeffhj / LM-reasoning
View on GitHub
This repository contains a collection of papers and resources on Reasoning in Large Language Models.
☆572Nov 13, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
TIGER-AI-Lab / MAmmoTH
View on GitHub
Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]
☆383Aug 25, 2024Updated last year
FranxYao / GPT-Bargaining
View on GitHub
Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback
☆207May 24, 2023Updated 3 years ago
iiis-ai / cumulative-reasoning
View on GitHub
[TMLR] Cumulative Reasoning With Large Language Models (https://arxiv.org/abs/2308.04371)
☆308Aug 2, 2025Updated 11 months ago
allenai / Lila
View on GitHub
A unified benchmark for math reasoning
☆90Jan 25, 2023Updated 3 years ago
iiis-ai / IterativeQuestionComposing
View on GitHub
[AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)
☆23Oct 2, 2025Updated 9 months ago
zorazrw / trove
View on GitHub
[ICML'24] TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks
☆33Sep 20, 2024Updated last year
lancopku / MUKI
View on GitHub
[Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models
☆19Mar 16, 2023Updated 3 years ago
ctlllll / reward_collapse
View on GitHub
☆26May 30, 2023Updated 3 years ago
taoyds / grappa
View on GitHub
☆31Sep 4, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
qtli / GSM-Plus
View on GitHub
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
☆66Jul 8, 2024Updated 2 years ago
lupantech / chameleon-llm
View on GitHub
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
☆1,140Dec 23, 2023Updated 2 years ago
FranxYao / chain-of-thought-hub
View on GitHub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
☆2,776Aug 4, 2024Updated last year
cyzhh / MMOS
View on GitHub
Mix of Minimal Optimal Sets (MMOS) of dataset has two advantages for two aspects, higher performance and lower construction costs on math…
☆73Jul 27, 2024Updated 2 years ago
GanjinZero / RAMM
View on GitHub
Codes and Pre-trained models for RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training [ACM MM 202…
☆29Nov 2, 2023Updated 2 years ago
GAIR-NLP / MathPile
View on GitHub
[NeurlPS D&B 2024] Generative AI for Math: MathPile
☆418Apr 4, 2025Updated last year
blender-nlp / MuMuQA
View on GitHub
☆23Apr 12, 2022Updated 4 years ago
Timothyxxx / Chain-of-ThoughtsPapers
View on GitHub
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
☆2,105Oct 5, 2023Updated 2 years ago
wellecks / llmstep
View on GitHub
llmstep: [L]LM proofstep suggestions in Lean 4.
☆154Nov 11, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
teacherpeterpan / Unsupervised-Multi-hop-QA
View on GitHub
Codes for NAACL 2021 Paper "Unsupervised Multi-hop Question Answering by Question Generation"
☆92Nov 16, 2022Updated 3 years ago
esteng / regal_program_learning
View on GitHub
☆27Sep 11, 2024Updated last year
xingdi-eric-yuan / imrc_graph_public
View on GitHub
Implementation for the EMNLP 2021 paper "Interactive Machine Comprehension with Dynamic Knowledge Graphs".
☆22Aug 31, 2021Updated 4 years ago
hendrycks / math
View on GitHub
The MATH Dataset (NeurIPS 2021)
☆1,377Sep 6, 2025Updated 10 months ago
zhaoxlpku / SubgoalXL
View on GitHub
☆26Aug 23, 2024Updated last year
wenhuchen / ML-Interview
View on GitHub
Preparing for ML Interviews.
☆54Jan 12, 2026Updated 6 months ago
YuxiXie / SelfEval-Guided-Decoding
View on GitHub
☆103Dec 7, 2023Updated 2 years ago