erfanMhi / base_reinforcement_learning
This is the code-base that I personally use as the starting point for any reinforcement learning codebase with the purpose of fast experimentation and analysis.
☆12Updated 2 years ago
Alternatives and similar repositories for base_reinforcement_learning:
Users that are interested in base_reinforcement_learning are comparing it to the libraries listed below
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Understanding RL vision Distill article☆23Updated 2 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 6 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Updated 6 years ago
- ☆43Updated 4 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- Variational Reinforcement Learning☆16Updated 8 months ago
- Generalised UDRL☆37Updated 2 years ago
- State Space Models for Reinforcement Learning in Tensorflow☆19Updated 6 years ago
- ☆31Updated 6 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆29Updated 4 years ago
- Implementation of SPW and DPW for Monte Carlo Tree Search in Continuous action/state space☆17Updated last year
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- Reinforcement learning algorithms in RLlib☆57Updated 10 months ago
- Code for "Continuous-Time Meta-Learning with Forward Mode Differentiation" (ICLR 2022)☆27Updated 3 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Updated 6 years ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆21Updated 3 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 2 years ago
- Contextual Bandits Action Elimination DQN☆21Updated 6 years ago
- Public Release of Plan2vec Implementation in pyTorch☆56Updated 2 years ago
- ☆17Updated 3 years ago
- Reward Learning by Simulating the Past☆44Updated 5 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆44Updated 2 years ago
- Auxiliary variable Markov chain Monte Carlo methods☆10Updated 7 years ago
- ☆28Updated 2 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 7 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Updated 5 years ago
- The official implementation of Memory-efficient DQN algorithm.☆10Updated last year