ekinakyurek / gpt3-arithmeticLinks

Scratchpad/Chain-of-Thought Prompts

☆12

Alternatives and similar repositories for gpt3-arithmetic

Users that are interested in gpt3-arithmetic are comparing it to the libraries listed below

Sorting:

allenai / Lila
A unified benchmark for math reasoning
☆88Updated 2 years ago
HKUNLP / subgoal-theorem-prover
Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"
☆19Updated 2 years ago
GanjinZero / math401-llm
Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?
☆56Updated 2 years ago
zorazrw / odex
[EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation
☆48Updated last year
esteng / regal_program_learning
☆24Updated 10 months ago
debjitpaul / refiner
About The corresponding code from our paper " REFINER: Reasoning Feedback on Intermediate Representations" (EACL 2024). Do not hesitate t…
☆70Updated last year
niansong1996 / lever
Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)
☆89Updated 2 years ago
YuxiXie / SelfEval-Guided-Decoding
☆100Updated last year
nkandpa2 / long_tail_knowledge
Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"
☆77Updated 2 years ago
iiis-ai / IterativeQuestionComposing
Official implementation of AAAI 2025 paper "Augmenting Math Word Problems via Iterative Question Composing"(https://arxiv.org/abs/2401.09…
☆20Updated 7 months ago
shunzh / Code-AI-Tree-Search
☆119Updated last year
asaparov / prontoqa
Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.
☆147Updated 9 months ago
protagolabs / odyssey-math
☆84Updated 6 months ago
sail-sg / symbolic-instruction-tuning
The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".
☆65Updated 2 years ago
salesforce / factualNLG
Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"
☆59Updated 6 months ago
rmshin / llm-mcts
☆41Updated last year
nayeon7lee / FactualityPrompt
☆87Updated 2 years ago
Zayne-sprague / MuSR
☆49Updated 11 months ago
archiki / ReCEval
Supporting code for ReCEval paper
☆29Updated 10 months ago
orhonovich / instruction-induction
☆66Updated 3 years ago
FranxYao / Complexity-Based-Prompting
Complexity Based Prompting for Multi-Step Reasoning
☆17Updated 2 years ago
wellecks / naturalprover
NaturalProver: Grounded Mathematical Proof Generation with Language Models
☆38Updated 2 years ago
reasoning-machines / CoCoGen
Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)
☆86Updated 2 years ago
princeton-nlp / Collie
[ICLR 2024] COLLIE: Systematic Construction of Constrained Text Generation Tasks
☆52Updated 2 years ago
princeton-nlp / NLProofS
EMNLP 2022: Generating Natural Language Proofs with Verifier-Guided Search https://arxiv.org/abs/2205.12443
☆86Updated 10 months ago
csitfun / LogiCoT
the instructions and demonstrations for building a formal logical reasoning capable GLM
☆53Updated 11 months ago
feyzaakyurek / rl4f
Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.
☆64Updated 8 months ago
zhangir-azerbayev / proof-pile
Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.
☆21Updated 2 years ago
wzhouad / context-faithful-llm
Code and data for paper "Context-faithful Prompting for Large Language Models".
☆41Updated 2 years ago
allenai / DecomP
Repository for Decomposed Prompting
☆93Updated last year