abdulhaim / LMRL-Gym
☆78Updated 7 months ago
Alternatives and similar repositories for LMRL-Gym:
Users that are interested in LMRL-Gym are comparing it to the libraries listed below
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆127Updated 10 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆120Updated 3 months ago
- ☆95Updated 7 months ago
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆97Updated 10 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆115Updated 5 months ago
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆40Updated last year
- ☆90Updated 3 weeks ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆125Updated 2 months ago
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆129Updated 10 months ago
- ☆45Updated this week
- Repository for the paper Stream of Search: Learning to Search in Language☆137Updated last week
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆101Updated last year
- ☆141Updated 9 months ago
- ☆27Updated 3 months ago
- ☆171Updated last year
- ☆106Updated 3 weeks ago
- Natural Language Reinforcement Learning☆71Updated last month
- Benchmarking Agentic LLM and VLM Reasoning On Games☆114Updated last week
- Can Language Models Solve Olympiad Programming?☆110Updated last month
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"