algopapi / EvoPrompting_Reinforcement_learningLinks
☆24Updated 2 years ago
Alternatives and similar repositories for EvoPrompting_Reinforcement_learning
Users that are interested in EvoPrompting_Reinforcement_learning are comparing it to the libraries listed below
Sorting:
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆37Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆99Updated 2 years ago
- Small and Efficient Mathematical Reasoning LLMs☆72Updated last year
- Codes for the EMNLP 2023 Findings paper "Self-Polish: Enhance Reasoning in Large Language Models via Problem Refining" by Zhiheng Xi, Sen…☆31Updated 2 years ago
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Updated last year
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆51Updated last year
- ☆67Updated 8 months ago
- ☆34Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated 2 years ago
- ☆31Updated last year
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆110Updated 11 months ago
- ☆86Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆126Updated 2 years ago
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆66Updated last year
- Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation☆48Updated last year
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆62Updated 7 months ago
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- Open Implementations of LLM Analyses☆107Updated last year
- Interactive coding assistant for data scientists and machine learning developers, empowered by large language models.☆98Updated last year
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆164Updated last year
- augmented LLM with self reflection☆135Updated 2 years ago
- Official repo of Respond-and-Respond: data, code, and evaluation☆104Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆117Updated 2 years ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Updated 2 years ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆131Updated last year
- ☆48Updated last year
- ☆78Updated 2 years ago
- Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation☆74Updated 2 years ago