RU-Automated-Reasoning-Group / pi-PRLLinks
ICLR'22 Programmatic Reinforcement Learning
☆16Updated 2 years ago
Alternatives and similar repositories for pi-PRL
Users that are interested in pi-PRL are comparing it to the libraries listed below
Sorting:
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆56Updated last year
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆69Updated last year
- ☆31Updated 2 years ago
- Official code repository for Prompt-DT.☆113Updated 2 years ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆52Updated last year
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Updated 2 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆59Updated 9 months ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆115Updated 3 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆63Updated last year
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆163Updated last year
- ☆48Updated last year
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Updated last year
- Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily ext…☆119Updated 2 years ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆32Updated 7 months ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated 11 months ago
- ☆46Updated 2 years ago
- Synthetic Experience Replay☆94Updated last year
- ☆89Updated 2 years ago
- ☆24Updated 2 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆103Updated last year
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆44Updated last year
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆45Updated 3 years ago
- ☆32Updated last year
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆35Updated 2 years ago
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆28Updated last year
- ☆17Updated last year
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated 2 years ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆108Updated 2 years ago