ekinakyurek / gpt3-arithmetic
Scratchpad/Chain-of-Thought Prompts
☆12Updated 2 years ago
Alternatives and similar repositories for gpt3-arithmetic:
Users that are interested in gpt3-arithmetic are comparing it to the libraries listed below
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆45Updated last year
- Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"☆17Updated last year
- ☆36Updated 5 months ago
- A unified benchmark for math reasoning☆87Updated last year
- NaturalProver: Grounded Mathematical Proof Generation with Language Models☆36Updated last year
- ☆22Updated 4 months ago
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆71Updated 2 years ago
- Evaluate the Quality of Critique☆35Updated 7 months ago
- Official implementation of AAAI 2025 paper "Augmenting Math Word Problems via Iterative Question Composing"(https://arxiv.org/abs/2401.09…☆18Updated last month
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆18Updated 2 years ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated 9 months ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆75Updated 9 months ago
- A framework for few-shot evaluation of autoregressive language models.☆24Updated last year
- ☆27Updated 10 months ago
- ☆22Updated 2 months ago
- CodeUltraFeedback: aligning large language models to coding preferences☆66Updated 6 months ago
- ☆45Updated last year
- Repository for Skill Set Optimization☆12Updated 5 months ago
- [ICLR 2024] COLLIE: Systematic Construction of Constrained Text Generation Tasks☆52Updated last year
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆74Updated last year
- About The corresponding code from our paper " REFINER: Reasoning Feedback on Intermediate Representations" (EACL 2024). Do not hesitate t…☆69Updated 10 months ago
- The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".☆63Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Updated this week
- Supporting code for ReCEval paper☆27Updated 4 months ago
- ☆18Updated 7 months ago
- Llemma formal2formal (tactic prediction) theorem proving experiments☆19Updated last year
- ☆75Updated last year
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆53Updated 4 months ago
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆23Updated 11 months ago
- Reasoning by Communicating with Agents☆23Updated 3 months ago