algopapi / EvoPrompting_Reinforcement_learningLinks
☆24Updated 2 years ago
Alternatives and similar repositories for EvoPrompting_Reinforcement_learning
Users that are interested in EvoPrompting_Reinforcement_learning are comparing it to the libraries listed below
Sorting:
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆100Updated 2 years ago
- ☆144Updated last year
- ☆34Updated last year
- ☆78Updated 2 years ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- Langchain implementation of HuggingGPT☆134Updated 2 years ago
- ☆86Updated 2 years ago
- Official repo of Respond-and-Respond: data, code, and evaluation☆104Updated last year
- ☆137Updated 2 years ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆117Updated 2 years ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆133Updated last year
- Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools☆144Updated 2 years ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆38Updated last year
- Evaluating LLMs with CommonGen-Lite☆93Updated last year
- ☆86Updated last year
- ☆48Updated last year
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Updated last year
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆95Updated 2 years ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated 2 years ago
- Based on the tree of thoughts paper☆48Updated 2 years ago
- Open Implementations of LLM Analyses☆108Updated last year
- ☆122Updated last year
- ☆173Updated 2 years ago
- [NAACL 2024] Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models☆86Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Updated 2 years ago
- Mixing Language Models with Self-Verification and Meta-Verification☆111Updated last year
- Reward Model framework for LLM RLHF☆61Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆104Updated 7 months ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆168Updated last year