RamonKaspar / MathPrompterLinks
MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Language Models' paper by Microsoft Research. The code replicates the methods discussed in the paper.
☆14Updated 7 months ago
Alternatives and similar repositories for MathPrompter
Users that are interested in MathPrompter are comparing it to the libraries listed below
Sorting:
- ☆156Updated last year
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"☆135Updated 2 years ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆69Updated last year
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆56Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- This is the official repository for Inheritune.☆115Updated 9 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆116Updated last month
- a curated list of the role of small models in the LLM era☆109Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆242Updated last year
- ☆98Updated 7 months ago
- A framework for few-shot evaluation of language models.☆35Updated 8 months ago
- [ICLR 2024] Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation☆180Updated last year
- ☆120Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆99Updated 2 years ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆275Updated last year
- ☆129Updated last year
- Official repo of Respond-and-Respond: data, code, and evaluation☆104Updated last year
- ☆78Updated last year
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆102Updated 3 months ago
- ☆146Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆155Updated 2 years ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆110Updated 11 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆122Updated 9 months ago
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- Self-Reflection in LLM Agents: Effects on Problem-Solving Performance☆91Updated 11 months ago
- Set of scripts to finetune LLMs☆38Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆72Updated last year
- Unofficial implementation of https://arxiv.org/pdf/2407.14679☆50Updated last year