stratisMarkou / sample-efficient-bayesian-rl
Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL
☆22Updated 2 years ago
Alternatives and similar repositories for sample-efficient-bayesian-rl:
Users that are interested in sample-efficient-bayesian-rl are comparing it to the libraries listed below
- ☆30Updated 5 years ago
- ☆54Updated last year
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆43Updated 8 months ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆52Updated 4 years ago
- ☆30Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆101Updated 2 years ago
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆61Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆85Updated 3 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26Updated 4 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Updated 6 years ago
- on-policy optimization baselines for deep reinforcement learning☆29Updated 4 years ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆59Updated 9 months ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- Learning Laplacian Representations in Reinforcement Learning☆17Updated 4 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆34Updated 2 years ago
- ☆42Updated last year
- ☆28Updated 9 months ago
- ☆26Updated last year
- Conservative Q learning in Jax☆52Updated 2 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆47Updated 2 years ago
- Mirror Descent Policy Optimization☆38Updated 4 years ago
- ☆97Updated last year
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆16Updated 7 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆60Updated last year
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆37Updated 5 months ago
- ☆42Updated 3 years ago
- ☆29Updated 2 years ago
- ☆47Updated 4 years ago
- An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch☆24Updated 4 years ago