mengdi-li / awesome-RLAIFLinks
A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)
☆194Updated 5 months ago
Alternatives and similar repositories for awesome-RLAIF
Users that are interested in awesome-RLAIF are comparing it to the libraries listed below
Sorting:
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆125Updated 9 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆191Updated last year
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆159Updated last year
- Reasoning with Language Model is Planning with World Model☆185Updated 2 years ago
- A brief and partial summary of RLHF algorithms.☆142Updated 10 months ago
- Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)☆199Updated 2 years ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆201Updated 9 months ago
- Paper collections of the continuous effort start from World Models.☆196Updated last year
- ☆195Updated last year
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆161Updated last year
- ☆220Updated 9 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆147Updated last year
- ☆117Updated 11 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆115Updated last year
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…☆289Updated 2 years ago
- AI Alignment: A Comprehensive Survey☆137Updated 2 years ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆151Updated 11 months ago
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆43Updated last year
- augmented LLM with self reflection☆135Updated 2 years ago
- Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"☆46Updated 10 months ago
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆92Updated last year
- An extensible benchmark for evaluating large language models on planning☆440Updated 3 months ago
- Code and example data for the paper: Rule Based Rewards for Language Model Safety☆203Updated last year
- Must-read Papers on Large Language Model (LLM) Planning.☆436Updated last year
- Critique-out-Loud Reward Models☆71Updated last year
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆407Updated 6 months ago
- ☆109Updated last year
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆162Updated 8 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆114Updated this week
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆150Updated last year