MasterScrat / rl-insights
🤖 Reinforcement Learning paper summaries, notebooks, and articles.
☆26Updated 4 years ago
Alternatives and similar repositories for rl-insights:
Users that are interested in rl-insights are comparing it to the libraries listed below
- Generalised UDRL☆37Updated 2 years ago
- Variational Reinforcement Learning☆16Updated 6 months ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆43Updated 3 years ago
- ☆14Updated 5 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆33Updated 5 years ago
- ☆19Updated 3 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Updated 4 years ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆26Updated 3 years ago
- fork of rl-baseline-zoo☆21Updated 4 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Updated 6 years ago
- Revisiting Rainbow☆74Updated 3 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 6 years ago
- Reward Learning by Simulating the Past☆44Updated 5 years ago
- ICRL 2020☆19Updated 5 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆31Updated 5 years ago
- Reinforcement Learning papers on exploration methods.☆20Updated 3 years ago
- ☆28Updated 2 years ago
- ☆35Updated 6 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆37Updated 5 years ago
- Codebase for Efficient yet simple Reinforcement Learning Research Framework☆28Updated 2 years ago
- Ranking Policy Gradient☆23Updated 5 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Clockwork VAEs in JAX/Flax☆32Updated 3 years ago
- Autoregressive policies for continuous control reinforcement learning☆29Updated 5 years ago
- A2C is a special case of PPO!☆19Updated 2 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 4 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year