RobertTLange / deep-rl-tutorial
A Tutorial on Deep Reinforcement Learning in PyTorch
☆31Updated last year
Alternatives and similar repositories for deep-rl-tutorial:
Users that are interested in deep-rl-tutorial are comparing it to the libraries listed below
- A curated list of papers presented in the 📖"Flexible Learning Reading Group" @ TU Berlin. Join us! 🤗☆27Updated 4 years ago
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Updated 5 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆60Updated 5 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆33Updated 5 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- ☆43Updated 3 years ago
- Baselines for gymnax 🤖☆66Updated last year
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 4 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆44Updated 2 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆18Updated 5 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆27Updated 2 years ago
- Map-Elites based on Evolution Strategies☆31Updated 3 years ago
- Variational Reinforcement Learning☆16Updated 7 months ago
- Code for Deep Reinforcement and InfoMax Learning (Neurips 2020)☆10Updated 4 years ago
- Generalised UDRL☆37Updated 2 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆48Updated 2 years ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆26Updated 3 years ago
- Some small scale experiments for my blog posts 📝☆79Updated 2 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- Reinforcement learning library in JAX.☆100Updated last year
- Vectorization techniques for fast population-based training.☆55Updated 2 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 6 years ago
- ☆28Updated 2 years ago
- Simple, extensible implementations of some meta-learning algorithms in Jax☆9Updated 4 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆29Updated 4 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- ☆14Updated 5 years ago
- Minimal A2C/A3C example of an LSTM-based meta-learner.☆13Updated 4 years ago
- Reinforcement learning algorithms in RLlib☆57Updated 10 months ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago