lansiz / eqpt
The Path to Nash Equilibrium
☆38Updated 2 years ago
Alternatives and similar repositories for eqpt:
Users that are interested in eqpt are comparing it to the libraries listed below
- ☆20Updated 5 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- 🤖 Reinforcement Learning paper summaries, notebooks, and articles.☆26Updated 5 years ago
- Code for human intervention reinforcement learning☆33Updated 7 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆94Updated 6 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- Reward Learning by Simulating the Past☆44Updated 5 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 7 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆31Updated 4 years ago
- ☆35Updated 6 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 7 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆31Updated 6 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- Logarithmic Reinforcement Learning☆26Updated 2 years ago
- Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"☆33Updated last year
- Baselines and memory-based scenarios for the ViZDoom simulator☆34Updated 2 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Updated 2 years ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆26Updated 3 years ago
- ☆17Updated 3 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 2 years ago
- Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration☆25Updated 5 years ago
- ☆44Updated 6 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Updated 5 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆43Updated 3 years ago
- Source code for ICLR 2020 paper: "Learning to Guide Random Search"☆39Updated 7 months ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 4 years ago
- Variational Reinforcement Learning☆16Updated 8 months ago