yardenas / la-mbdaLinks
LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization
β37Updated 2 years ago
Alternatives and similar repositories for la-mbda
Users that are interested in la-mbda are comparing it to the libraries listed below
Sorting:
- π₯ Datasets and env wrappers for offline safe reinforcement learningβ105Updated last year
- Benchmarking RL generalization in an interpretable way.β163Updated 3 months ago
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020β45Updated 2 years ago
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan β¦β71Updated 2 years ago
- β57Updated 2 years ago
- β31Updated 2 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline settingβ35Updated 4 years ago
- Simple maze environments using mujoco-pyβ56Updated last year
- Author's PyTorch implementation of TD7 for online and offline RLβ149Updated 2 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.β68Updated 2 years ago
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022β30Updated 2 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"β101Updated 3 years ago
- Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., SchΓΆlkopf, B., Martiβ¦β46Updated 3 years ago
- Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"β46Updated 3 years ago
- β54Updated last year
- Implementations of SAILR, PDO, and CSCβ31Updated last year
- Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).β29Updated 3 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordinationβ26Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)β66Updated last year
- Code for MOPO: Model-based Offline Policy Optimizationβ189Updated 3 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPOβ179Updated 3 years ago
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARLβ43Updated this week
- ExORL: Exploratory Data for Offline Reinforcement Learningβ116Updated 3 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]β40Updated 3 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learningβ28Updated 3 years ago
- Model-Based Offline Reinforcement Learningβ50Updated 4 years ago
- Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)β21Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimationβ25Updated 2 years ago
- β201Updated 2 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Methodβ66Updated 2 years ago