Sheepsody / Batched-Impala-PyTorch

Reinforcement learning - Batched Impala - PyTorch - Mario Kart

☆14

Alternatives and similar repositories for Batched-Impala-PyTorch:

Users that are interested in Batched-Impala-PyTorch are comparing it to the libraries listed below

johnlime / RlkitExtension
Collection of reinforcement learning algorithms
☆15Updated 3 years ago
seungjaeryanlee / rl-exploration
Reinforcement Learning papers on exploration methods.
☆19Updated 3 years ago
facebookresearch / adversarially-motivated-intrinsic-goals
This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".
☆61Updated last year
srsohn / msgi
ICLR 2020 Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies
☆18Updated 4 years ago
hiwonjoon / ICML2019-TREX
☆83Updated 4 years ago
RobertTLange / spinningup-workspace
Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.
☆38Updated 2 years ago
Pervasive-AI-Lab / crlmaze
Continual Reinforcement Learning in 3D Non-stationary Environments
☆37Updated 5 years ago
mila-iqia / spr
Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"
☆161Updated 3 years ago
uber-research / D3G
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Updated 5 years ago
facebookresearch / icp-block-mdp
Invariant Causal Prediction for Block MDPs
☆44Updated 4 years ago
jeanharb / a2oc_delib
A3C style Option-Critic with deliberation cost
☆39Updated 7 years ago
johanobandoc / revisiting_rainbow
Revisiting Rainbow
☆74Updated 3 years ago
sunblaze-ucb / rl-generalization
Modifiable OpenAI Gym environments for studying generalization in RL
☆87Updated 6 years ago
DartML / PPO-Stein-Control-Variate
Proximal Policy Optimization with Stein Control Variates:
☆33Updated 7 years ago
jannerm / gamma-models
Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"
☆44Updated last year
nnaisense / MAGE
Learning Action-Value Gradients in Model-based Policy Optimization
☆31Updated 3 years ago
veronicachelu / meta-learning
Meta Reinforcement Learning Experiments
☆34Updated 7 years ago
siekmanj / r2l
Recurrent continuous reinforcement learning algorithms implemented in Pytorch.
☆51Updated 3 years ago
alexis-jacq / LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆95Updated 6 years ago
schroederdewitt / mackrl
Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)
☆33Updated 5 years ago
reinforcement-learning-kr / rl-montezuma
The state-of-art deep rl algorithms for Montezuma's revenge
☆25Updated 6 years ago
philipjball / OffCon3
📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)
☆24Updated 3 years ago
oxwhirl / opiq
Code for Optimistic Exploration even with a Pessimistic Initialisation
☆14Updated 4 years ago
facebookresearch / level-replay
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …
☆85Updated 3 years ago
DavidJanz / successor_uncertainties_atari
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Updated 2 years ago
joeybose / FloRL
Implicit Normalizing Flows + Reinforcement Learning
☆61Updated 5 years ago
mklissa / PPOC
Proximal Policy Option-Critic
☆23Updated 6 years ago
microsoft / logrl
Logarithmic Reinforcement Learning
☆26Updated 2 years ago
victorcampos7 / edl
Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"
☆37Updated 5 years ago
kkhetarpal / ioc
Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020
☆25Updated 4 years ago