lansiz / eqptLinks
The Path to Nash Equilibrium
☆38Updated 2 years ago
Alternatives and similar repositories for eqpt
Users that are interested in eqpt are comparing it to the libraries listed below
Sorting:
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆31Updated 6 years ago
- ☆80Updated last year
- Reward Learning by Simulating the Past☆44Updated 6 years ago
- 🤖 Reinforcement Learning paper summaries, notebooks, and articles.☆26Updated 5 years ago
- Source code for ICLR 2020 paper: "Learning to Guide Random Search"☆40Updated 10 months ago
- Variational Walkback, NIPS'17☆28Updated 7 years ago
- Quasi-Newton Algorithm for Stochastic Optimization☆10Updated 3 years ago
- Open-source library for a reinforcement learning research.☆54Updated 2 years ago
- Flexible Reinforcement Learning Framework with PyTorch☆22Updated 5 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 7 years ago
- Statistical adaptive stochastic optimization methods☆32Updated 5 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆43Updated 4 years ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆26Updated 3 years ago
- ☆21Updated 6 years ago
- ☆45Updated 5 years ago
- Neuronal Circuit Policies☆40Updated 2 years ago
- Public Release of Plan2vec Implementation in pyTorch☆56Updated 2 years ago
- A2C for GVG-AI☆21Updated 6 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Updated 6 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Updated 5 years ago
- fork of rl-baseline-zoo☆21Updated 5 years ago
- Neural Fixed-Point Acceleration for Convex Optimization☆29Updated 2 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆95Updated 6 years ago
- Computing the eigenvalues of Neural Tangent Kernel and Conjugate Kernel (aka NNGP kernel) over the boolean cube☆47Updated 5 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Updated 6 years ago
- Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"☆33Updated last year
- Code for NeurIPS 2019 paper: "Symmetry-Based Disentangled Representation Learning requires Interaction with Environments" by H. Caselles-…☆35Updated 5 years ago
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Updated 7 years ago
- Understanding RL vision Distill article☆23Updated 2 years ago