ekinakyurek / gpt3-arithmetic
Scratchpad/Chain-of-Thought Prompts
☆12Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for gpt3-arithmetic
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆44Updated 10 months ago
- Supporting code for ReCEval paper☆26Updated 2 months ago
- code for "Natural Language to Code Translation with Execution"☆39Updated 2 years ago
- Official implementation of DPFM @ ICLR 2024 paper "Augmenting Math Word Problems via Iterative Question Composing"(https://arxiv.org/abs/…☆15Updated 8 months ago
- Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"☆17Updated last year
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆71Updated 2 years ago
- ☆40Updated 2 years ago
- ☆31Updated 7 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated last week
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆56Updated last year
- ☆18Updated 5 months ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆18Updated last year
- ☆89Updated 11 months ago
- ☆21Updated 2 weeks ago
- NaturalProver: Grounded Mathematical Proof Generation with Language Models☆34Updated last year
- ☆34Updated 3 months ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated 7 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆47Updated 4 months ago
- Code for generating the JuICe dataset.☆37Updated 3 years ago
- Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023…☆29Updated last year
- A unified benchmark for math reasoning☆87Updated last year
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆73Updated last year
- Evaluate the Quality of Critique☆35Updated 5 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆30Updated 3 months ago
- Tasks for describing differences between text distributions.☆16Updated 3 months ago
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆52Updated 2 months ago
- Repository for Skill Set Optimization☆12Updated 3 months ago
- Influence Experiments☆35Updated last year
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆76Updated 7 months ago