lansiz / eqpt
The Path to Nash Equilibrium
☆38Updated last year
Related projects ⓘ
Alternatives and complementary repositories for eqpt
- Code for "Learning Inductive Biases with Simple Neural Networks" (Feinman & Lake, 2018).☆21Updated 5 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆43Updated 3 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆30Updated 5 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 6 years ago
- Neuronal Circuit Policies☆40Updated 2 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆21Updated 6 years ago
- ☆14Updated 5 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 3 years ago
- Reward Learning by Simulating the Past☆43Updated 5 years ago
- A2C for GVG-AI☆21Updated 6 years ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆25Updated 2 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 6 years ago
- Public Release of Plan2vec Implementation in pyTorch☆56Updated 2 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆15Updated 6 years ago
- Using Rainbow implementation in Chainer RL for Slime Volleyball Pixel Environment☆23Updated 4 years ago
- ☆35Updated 6 years ago
- Inferring beliefs about dynamics from behavior☆28Updated 6 years ago
- fork of rl-baseline-zoo☆21Updated 4 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- PPO Dash: Improving Generalization in Deep Reinforcement Learning☆16Updated 5 years ago
- ☆42Updated 5 years ago
- Source code for ICLR 2020 paper: "Learning to Guide Random Search"☆39Updated 2 months ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- Minimal and Clean Reinforcement Learning Examples in PyTorch☆41Updated 5 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆45Updated 5 years ago
- 🤖 Reinforcement Learning paper summaries, notebooks, and articles.☆26Updated 4 years ago