lansiz / eqptLinks

The Path to Nash Equilibrium

☆38

Alternatives and similar repositories for eqpt

Users that are interested in eqpt are comparing it to the libraries listed below

Sorting:

facebookresearch / rela
Reinforcement Learning Assembly
☆92Updated 3 years ago
AdeelMufti / RL-RND
Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation
☆31Updated 6 years ago
mfranzs / meta-learning-curiosity-algorithms
☆80Updated last year
HumanCompatibleAI / rlsp
Reward Learning by Simulating the Past
☆44Updated 6 years ago
MasterScrat / rl-insights
🤖 Reinforcement Learning paper summaries, notebooks, and articles.
☆26Updated 5 years ago
isl-org / LMRS
Source code for ICLR 2020 paper: "Learning to Guide Random Search"
☆40Updated 10 months ago
anirudh9119 / walkback_nips17
Variational Walkback, NIPS'17
☆28Updated 7 years ago
vtopt / qnstop
Quasi-Newton Algorithm for Stochastic Optimization
☆10Updated 3 years ago
opium-sh / prl
Open-source library for a reinforcement learning research.
☆54Updated 2 years ago
rotinov / AGNES
Flexible Reinforcement Learning Framework with PyTorch
☆22Updated 5 years ago
flowersteam / Unsupervised_Goal_Space_Learning
Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"
☆21Updated 7 years ago
microsoft / statopt
Statistical adaptive stochastic optimization methods
☆32Updated 5 years ago
willwhitney / dynamics-aware-embeddings
Official implementation of DynE, Dynamics-aware Embeddings for RL
☆43Updated 4 years ago
kachayev / gym-microrts-paper-sb3
RL agent to play μRTS with Stable-Baselines3 and PyTorch
☆26Updated 3 years ago
ctallec / continuous-rl
☆21Updated 6 years ago
kyunghyuncho / backprop-kalman-filter
☆45Updated 5 years ago
mlech26l / ordinary_neural_circuits
Neuronal Circuit Policies
☆40Updated 2 years ago
geyang / plan2vec
Public Release of Plan2vec Implementation in pyTorch
☆56Updated 2 years ago
njustesen / a2c_gvgai
A2C for GVG-AI
☆21Updated 6 years ago
KMarino / hrl-ep3
Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies
☆15Updated 6 years ago
YyzHarry / SV-RL
[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning
☆34Updated 5 years ago
hardmaru / rlzoo
fork of rl-baseline-zoo
☆21Updated 5 years ago
facebookresearch / neural-scs
Neural Fixed-Point Acceleration for Convex Optimization
☆29Updated 2 years ago
alexis-jacq / LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆95Updated 6 years ago
thegregyang / NNspectra
Computing the eigenvalues of Neural Tangent Kernel and Conjugate Kernel (aka NNGP kernel) over the boolean cube
☆47Updated 5 years ago
R-McHenry / ParallelizedGoExplore
A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post
☆46Updated 6 years ago
tuomaso / radial_rl
Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"
☆33Updated last year
Caselles / NeurIPS19-SBDRL
Code for NeurIPS 2019 paper: "Symmetry-Based Disentangled Representation Learning requires Interaction with Environments" by H. Caselles-…
☆35Updated 5 years ago
Kajiyu / kanerva_machine
The implementation of "The Kanerva Machine" with Pytorch and Pyro
☆12Updated 7 years ago
distillpub / post--understanding-rl-vision
Understanding RL vision Distill article
☆23Updated 2 years ago