dunnolab / awesome-in-context-rl
Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —
☆150Updated 3 weeks ago
Alternatives and similar repositories for awesome-in-context-rl:
Users that are interested in awesome-in-context-rl are comparing it to the libraries listed below
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - —☆66Updated last month
- ☆85Updated 9 months ago
- JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️☆275Updated 4 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆148Updated 4 months ago
- Official Implementation for "In-Context Reinforcement Learning for Variable Action Spaces"☆83Updated last year
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆80Updated last month
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆71Updated last year
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆230Updated 2 weeks ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆60Updated 10 months ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆156Updated last year
- a simple and scalable agent for training adaptive policies with sequence-based RL☆117Updated last month
- Natural Language Reinforcement Learning☆84Updated 3 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆96Updated 5 months ago
- ☆137Updated 4 months ago
- ☆103Updated 2 months ago
- Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"☆29Updated 6 months ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆126Updated last week
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆39Updated last year
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆161Updated last year
- A library for constrained RLHF.☆13Updated last year
- Vintix: Action Model via In-Context Reinforcement Learning - - —☆33Updated last month
- ☆218Updated 4 months ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated 8 months ago
- ☆141Updated 11 months ago
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆362Updated 11 months ago
- Code for Contrastive Preference Learning (CPL)☆164Updated 4 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 7 months ago
- A brief and partial summary of RLHF algorithms.☆127Updated 3 weeks ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆223Updated 4 months ago
- ☆74Updated last year