PKU-RL / AdaRefinerLinks
AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)
☆18Updated last year
Alternatives and similar repositories for AdaRefiner
Users that are interested in AdaRefiner are comparing it to the libraries listed below
Sorting:
- ☆89Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆94Updated 2 years ago
- An elegant PyTorch offline reinforcement learning library for researchers.☆382Updated 6 months ago
- [NeurIPS 2024] Official Implementation of Meta-DT☆51Updated last year
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆166Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆161Updated 2 years ago
- Datasets with baselines for Offline MARL.☆198Updated 2 months ago
- Online Decision Transformer☆273Updated last year
- Official code repository for Prompt-DT.☆120Updated 3 years ago
- Re-implementations of SOTA RL algorithms.☆136Updated 2 years ago
- A collection of offline reinforcement learning algorithms.☆208Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆230Updated last year
- ☆116Updated 2 years ago
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆286Updated 3 years ago
- ☆63Updated last year
- Model-based Offline Policy Optimization re-implement all by pytorch☆39Updated 2 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆175Updated 2 years ago
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆26Updated 5 months ago
- ☆307Updated 3 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆171Updated last year
- Overcooked human-AI experiment platform☆39Updated 2 years ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆234Updated last year
- NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark☆534Updated last month
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms☆391Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆183Updated 3 years ago
- Source code of the ICML24 paper "Self-Composing Policies for Scalable Continual Reinforcement Learning" (selected for oral presentation)☆24Updated last year
- ☆362Updated 2 years ago
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆60Updated last year
- Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method☆46Updated last year