crowdAI / marlo-single-agent-starter-kitLinks

Round 1 Starter Kit for the MarLo challenge

☆20

Alternatives and similar repositories for marlo-single-agent-starter-kit

Users that are interested in marlo-single-agent-starter-kit are comparing it to the libraries listed below

Sorting:

sunblaze-ucb / rl-generalization
Modifiable OpenAI Gym environments for studying generalization in RL
☆87Updated 6 years ago
johanobandoc / revisiting_rainbow
Revisiting Rainbow
☆75Updated 4 years ago
zuoxingdong / dm2gym
Convert DeepMind Control Suite to OpenAI gym environments.
☆87Updated 5 years ago
RLAgent / state-marginal-matching
Efficient Exploration via State Marginal Matching (2019)
☆69Updated 6 years ago
facebookresearch / impact-driven-exploration
impact-driven-exploration
☆131Updated last year
tdavchev / option-critic
A Tensorflow implementation of the Option-Critic Architecture
☆71Updated 8 years ago
DavidJanz / successor_uncertainties_atari
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Updated 2 years ago
russellmendonca / maesn_suite
☆43Updated 6 years ago
WilsonWangTHU / POPLIN
☆99Updated 2 years ago
Alfo5123 / Robust-Multitask-RL
Machine Learning Course Project Skoltech 2018
☆108Updated 6 years ago
victorcampos7 / edl
Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"
☆36Updated 5 years ago
justinjfu / diagnosing_qlearning
Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.
☆19Updated 6 years ago
dnddnjs / feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
☆96Updated 2 years ago
takuseno / d4rl-pybullet
Datasets for data-driven deep reinforcement learning with PyBullet environments
☆150Updated 4 years ago
jonasrothfuss / model_ensemble_meta_learning
Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm
☆44Updated 6 years ago
DartML / PPO-Stein-Control-Variate
Proximal Policy Optimization with Stein Control Variates:
☆33Updated 7 years ago
junjungoal / IMPALA-pytorch
PyTorch IMPALA implementation
☆26Updated 5 years ago
ruizhaogit / mep
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)
☆24Updated 6 years ago
hiwonjoon / ICML2019-TREX
☆84Updated 4 years ago
nathangrinsztajn / Box-World
Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"
☆46Updated last year
nnaisense / MAX
Code for reproducing experiments in Model-Based Active Exploration, ICML 2019
☆79Updated 5 years ago
alexlee-gk / slac
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
☆151Updated 4 years ago
rraileanu / auto-drac
Automatic Data-Regularized Actor-Critic (Auto-DrAC)
☆102Updated 2 years ago
higgsfield / Imagination-Augmented-Agents
Building Agents with Imagination: pytorch step-by-step implementation
☆209Updated 6 years ago
facebookresearch / icp-block-mdp
Invariant Causal Prediction for Block MDPs
☆44Updated 5 years ago
jeanharb / a2oc_delib
A3C style Option-Critic with deliberation cost
☆39Updated 7 years ago
seungjaeryanlee / rldb
Performances of Reinforcement Learning Agents
☆53Updated 5 years ago
hu-po / pySACQ
PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)
☆37Updated 4 years ago
yusukeurakami / plan2explore-pytorch
☆43Updated 4 years ago
google-deepmind / dm_hard_eight
☆84Updated 4 years ago