Miffyli / rl-action-space-shapingLinks

Experiment code for testing effect of various action space transformations in reinforcement learning

☆30

Alternatives and similar repositories for rl-action-space-shaping

Users that are interested in rl-action-space-shaping are comparing it to the libraries listed below

Sorting:

eugenevinitsky / robust_RL_multi_adversary
We investigate the effect of populations on finding good solutions to the robust MDP
☆28Updated 4 years ago
montrealrobotics / iv_rl
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆40Updated 3 weeks ago
oist-cnru / Variational-Recurrent-Models
Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…
☆55Updated 4 years ago
gyh75520 / Relational_DRL
Implementation of Relational Deep Reinforcement Learning
☆25Updated 5 years ago
rmst / rlrd
PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)
☆41Updated 3 years ago
rraileanu / idaac
☆54Updated last year
BlueFisher / Advanced-Soft-Actor-Critic
Soft Actor-Critic with advanced features
☆50Updated last week
uncharted-technologies / risk-and-uncertainty
Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"
☆31Updated 2 years ago
hari-sikchi / LOOP
Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]
☆40Updated 2 years ago
uoe-agents / derl
The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)
☆27Updated 3 years ago
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆51Updated 2 months ago
TonghanWang / DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)
☆51Updated 2 years ago
YYCAAA / V-MPO_Lunarlander
Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238
☆48Updated 4 years ago
BY571 / IQN-and-Extensions
PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…
☆89Updated 2 years ago
jparkerholder / DvD_ES
Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…
☆44Updated 4 years ago
daniellawson9999 / online-decision-transformer
An unofficial implementation for online decision transformer
☆40Updated 2 years ago
apourchot / CEM-RL
Combining Evolutionary Algorithms and deep RL in various ways
☆103Updated 4 years ago
ElisevanderPol / mdp-homomorphic-networks
☆29Updated 4 years ago
junjungoal / IMPALA-pytorch
PyTorch IMPALA implementation
☆27Updated 5 years ago
RobertTLange / spinningup-workspace
Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.
☆39Updated 2 years ago
schroederdewitt / mackrl
Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)
☆32Updated 5 years ago
robintyh1 / onpolicybaselines
on-policy optimization baselines for deep reinforcement learning
☆30Updated 5 years ago
kngwyu / Rainy
Deep RL agents with PyTorch
☆35Updated 3 years ago
facebookresearch / level-replay
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …
☆88Updated 4 years ago
toshikwa / slac.pytorch
PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).
☆93Updated last year
quantumiracle / MARS
MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.
☆49Updated last year
johannah / bootstrap_dqn
Implementation of Bootstrap DQN and Randomized Prior Functions on ALE
☆54Updated 4 months ago
tesslerc / GAC
Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"
☆22Updated 5 years ago
hejia-zhang / awesome-model-based-reinforcement-learning
A curated list of awesome Model-based reinforcement learning resources
☆94Updated 4 years ago
xkianteb / dril
Disagreement-Regularized Imitation Learning
☆30Updated 4 years ago