lansiz / eqptLinks
The Path to Nash Equilibrium
☆38Updated 2 years ago
Alternatives and similar repositories for eqpt
Users that are interested in eqpt are comparing it to the libraries listed below
Sorting:
- Source code for ICLR 2020 paper: "Learning to Guide Random Search"☆40Updated last year
- ☆80Updated 2 years ago
- 🤖 Reinforcement Learning paper summaries, notebooks, and articles.☆26Updated 5 years ago
- Reward Learning by Simulating the Past☆45Updated 6 years ago
- A2C for GVG-AI☆22Updated 6 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 7 years ago
- Statistical adaptive stochastic optimization methods☆32Updated 5 years ago
- Public Release of Plan2vec Implementation in pyTorch☆57Updated 2 years ago
- Reinforcement Learning Assembly☆92Updated 4 years ago
- ☆35Updated 7 years ago
- Computing the eigenvalues of Neural Tangent Kernel and Conjugate Kernel (aka NNGP kernel) over the boolean cube☆47Updated 6 years ago
- The Differentiable Cross-Entropy Method☆124Updated 5 years ago
- Code for human intervention reinforcement learning☆35Updated 7 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 7 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Updated 5 years ago
- ☆53Updated 5 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 6 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 7 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 6 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆31Updated 6 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 5 years ago
- Neural Fixed-Point Acceleration for Convex Optimization☆29Updated 3 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆66Updated 6 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆44Updated 4 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Updated 5 years ago
- Code for Unbiased Implicit Variational Inference (UIVI)☆14Updated 6 years ago
- fork of rl-baseline-zoo☆21Updated 5 years ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆26Updated 3 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated 2 years ago