algopapi / EvoPrompting_Reinforcement_learningLinks
☆24Updated 2 years ago
Alternatives and similar repositories for EvoPrompting_Reinforcement_learning
Users that are interested in EvoPrompting_Reinforcement_learning are comparing it to the libraries listed below
Sorting:
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- ☆48Updated last year
- ☆86Updated 2 years ago
- Mixing Language Models with Self-Verification and Meta-Verification☆112Updated last year
- A repository for transformer critique learning and generation☆89Updated 2 years ago
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆193Updated 2 weeks ago
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆66Updated 2 years ago
- ☆139Updated 2 years ago
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Updated last year
- ☆78Updated 2 years ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆132Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆73Updated 2 years ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆101Updated 2 years ago
- Based on the tree of thoughts paper☆48Updated 2 years ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Updated last year
- Self-Controlled Memory System for LLMs☆49Updated last year
- "Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases" b…☆45Updated last year
- Evaluating LLMs with CommonGen-Lite☆93Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆119Updated 2 years ago
- ☆87Updated 2 years ago
- Finetune any model on HF in less than 30 seconds☆56Updated last week
- ☆84Updated 2 years ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆40Updated 2 years ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆100Updated 2 years ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Updated 2 years ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆51Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated 2 years ago
- This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.☆132Updated last year