dunnolab / awesome-in-context-rl
Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —
☆159Updated 2 weeks ago
Alternatives and similar repositories for awesome-in-context-rl:
Users that are interested in awesome-in-context-rl are comparing it to the libraries listed below
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - —☆66Updated 2 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆151Updated 5 months ago
- ☆90Updated 9 months ago
- Vintix: Action Model via In-Context Reinforcement Learning - - —☆34Updated last month
- ☆55Updated last month
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆89Updated 2 weeks ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆162Updated last week
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆71Updated last year
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆63Updated 10 months ago
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆235Updated last month
- ☆107Updated 3 months ago
- Natural Language Reinforcement Learning☆87Updated 4 months ago
- Official Implementation for "In-Context Reinforcement Learning for Variable Action Spaces"☆87Updated last year
- ☆142Updated 11 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆163Updated last year
- a simple and scalable agent for training adaptive policies with sequence-based RL☆119Updated this week
- Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"☆29Updated 7 months ago
- ☆224Updated 5 months ago
- ☆137Updated 5 months ago
- JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️☆282Updated 5 months ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆131Updated this week
- ☆76Updated last year
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆227Updated 5 months ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated 9 months ago
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆55Updated last year
- Code for Contrastive Preference Learning (CPL)☆166Updated 5 months ago
- Simplest and Cleanest DreamerV3 implementation out there☆60Updated last month
- Dateset Reset Policy Optimization☆30Updated last year
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆112Updated this week
- (ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training☆265Updated 11 months ago