RobertTLange / deep-rl-tutorial
A Tutorial on Deep Reinforcement Learning in PyTorch
☆31Updated last year
Alternatives and similar repositories for deep-rl-tutorial:
Users that are interested in deep-rl-tutorial are comparing it to the libraries listed below
- A curated list of papers presented in the 📖"Flexible Learning Reading Group" @ TU Berlin. Join us! 🤗☆27Updated 4 years ago
- Official codebase for Adaptive Online Planning for Continual Lifelong Learning.☆16Updated 4 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆44Updated 2 years ago
- Map-Elites based on Evolution Strategies☆31Updated 2 years ago
- ☆43Updated 3 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆60Updated 5 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 6 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆48Updated 2 years ago
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Updated 5 years ago
- Reinforcement learning algorithms in RLlib☆56Updated 8 months ago
- Understanding RL vision Distill article☆23Updated last year
- Baselines for gymnax 🤖☆61Updated last year
- Logarithmic Reinforcement Learning☆26Updated last year
- Variational Reinforcement Learning☆16Updated 6 months ago
- 🤖 Reinforcement Learning paper summaries, notebooks, and articles.☆26Updated 4 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 4 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 6 years ago
- Toy environment set for multi-agent reinforcement learning and more☆38Updated 2 months ago
- Code for Deep Reinforcement and InfoMax Learning (Neurips 2020)☆10Updated 3 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆37Updated 4 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆29Updated 4 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆17Updated 5 years ago
- **Sferes2 module** A unifying modular framework for Quality-Diversity algorithms☆22Updated 4 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆26Updated 2 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- A Towers of Hanoi environment in OpenAI Gym Style☆13Updated 5 years ago
- Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs☆15Updated 8 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆85Updated 3 years ago