RobertTLange / deep-rl-tutorial
A Tutorial on Deep Reinforcement Learning in PyTorch
☆29Updated last year
Related projects: ⓘ
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆44Updated last year
- Official codebase for Adaptive Online Planning for Continual Lifelong Learning.☆16Updated 4 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆35Updated 3 years ago
- A curated list of papers presented in the 📖"Flexible Learning Reading Group" @ TU Berlin. Join us! 🤗☆27Updated 3 years ago
- Map-Elites based on Evolution Strategies☆31Updated 2 years ago
- Reinforcement learning algorithms in RLlib☆55Updated 4 months ago
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Updated 4 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆26Updated 4 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆60Updated 5 years ago
- Variational Reinforcement Learning☆16Updated last month
- Understanding RL vision Distill article☆23Updated last year
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 3 years ago
- Vectorization techniques for fast population-based training.☆52Updated 2 years ago
- Model-based reinforcement learning in TensorFlow☆53Updated 3 years ago
- Minimal A2C/A3C example of an LSTM-based meta-learner.☆13Updated 3 years ago
- ☆14Updated 5 years ago
- A Towers of Hanoi environment in OpenAI Gym Style☆12Updated 5 years ago
- Baselines for gymnax 🤖☆57Updated last year
- Source code of "Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning" (AAMAS 2021).☆27Updated 3 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆17Updated 4 years ago
- Reinforcement Learning via Latent State Decoding☆30Updated last year
- ☆10Updated 4 years ago
- Clockwork VAEs in JAX/Flax☆31Updated 3 years ago
- Some small scale experiments for my blog posts 📝☆78Updated 2 years ago
- Code for "Continuous-Time Meta-Learning with Forward Mode Differentiation" (ICLR 2022)☆27Updated 2 years ago
- Code for Deep Reinforcement and InfoMax Learning (Neurips 2020)☆10Updated 3 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆21Updated 5 years ago
- The Differentiable Cross-Entropy Method☆122Updated 4 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago