JohnLangford / RL_acidLinks

Some hard problems for reinforcement learning.

☆31

Alternatives and similar repositories for RL_acid

Users that are interested in RL_acid are comparing it to the libraries listed below

Sorting:

ofirnachum / models
Models built with TensorFlow
☆25Updated 6 years ago
flowersteam / rl-difference-testing
Simple tools for statistical analyses in RL experiments
☆67Updated 7 years ago
kvfrans / parallel-trpo
A parallel version of Trust Region Policy Optimization
☆65Updated 8 years ago
Breakend / DeepReinforcementLearningThatMatters
Accompanying code for "Deep Reinforcement Learning that Matters"
☆152Updated 7 years ago
Breakend / RLSSContinuousControlTutorial
Tutorial on continuous control at Reinforcement Learning Summer School 2017.
☆34Updated 8 years ago
iosband / TabulaRL
☆65Updated last year
hardmaru / astool
Augmented environments with RL
☆104Updated 6 years ago
junhyukoh / value-prediction-network
NIPS 2017 Value Prediction Network
☆166Updated 7 years ago
floringogianu / categorical-dqn
A working implementation of the Categorical DQN (Distributional RL).
☆96Updated 7 years ago
chrodan / tdlearn
some common TD Learning algorithms
☆66Updated 5 years ago
flowersteam / geppg
☆35Updated 6 years ago
Scitator / Run-Skeleton-Run
Reason8.ai PyTorch solution for NIPS RL 2017 challenge
☆84Updated 5 years ago
wojzaremba / trpo
☆101Updated 8 years ago
DanielTakeshi / rl_algorithms
I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…
☆51Updated 5 years ago
siemens / policy_search_bb-alpha
☆69Updated 7 years ago
ericjang / e2c
TensorFlow impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
☆65Updated 9 years ago
rlbayes / rllabplusplus
☆159Updated 8 years ago
eparisotto / ActorMimic
Train an RL agent to play multiple Atari games at once
☆69Updated 9 years ago
ehknight / natural-gradient-deep-q-learning
☆22Updated 6 years ago
AdamStelmaszczyk / learning2run
Our NIPS 2017: Learning to Run source code
☆55Updated 2 years ago
ethanluoyc / e2c-pytorch
E2C implementation in PyTorch
☆43Updated 8 years ago
eringrant / spirl-readings
A collection of reading material for the Workshop on "Structure & Priors in Reinforcement Learning" (SPiRL) at ICLR 2019.
☆13Updated 4 years ago
nnaisense / 2017-learning-to-run
The Winning Solution for the Learning To Run Challenge 2017
☆60Updated 7 years ago
Feryal / craft-env
☆44Updated 6 years ago
pathak22 / modular-assemblies
[NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"
☆116Updated 5 years ago
alexis-jacq / LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆96Updated 6 years ago
instadeepai / AlphaNPI
Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
☆79Updated last year
vicariousinc / schema-games
General Game Playing with Schema Networks
☆41Updated 3 years ago
flowersteam / Unsupervised_Goal_Space_Learning
Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"
☆21Updated 7 years ago
openai / baselines-results
☆117Updated 5 years ago