davidsandberg / rl_ssmsLinks

State Space Models for Reinforcement Learning in Tensorflow

☆19

Alternatives and similar repositories for rl_ssms

Users that are interested in rl_ssms are comparing it to the libraries listed below

Sorting:

johannah / bootstrap_dqn
Implementation of Bootstrap DQN and Randomized Prior Functions on ALE
☆54Updated 4 months ago
brett-daley / dqn-lambda
NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.
☆23Updated last year
tdavchev / option-critic
A Tensorflow implementation of the Option-Critic Architecture
☆71Updated 8 years ago
nikhilbarhate99 / Deterministic-GAIL-PyTorch
PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning
☆67Updated 5 years ago
ahq1993 / inverse_rl
Adversarial Imitation Via Variational Inverse Reinforcement Learning
☆95Updated 5 years ago
hiwonjoon / ICML2019-TREX
☆84Updated 4 years ago
fshamshirdar / pytorch-rdpg
PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)
☆55Updated 2 years ago
kazizzad / BDQN-MxNet-Gluon
Efficient Exploration through Bayesian Deep Q-Networks
☆37Updated 7 years ago
mcgillmrl / prob_mbrl
A library of probabilistic model based RL algorithms in pytorch
☆107Updated 4 years ago
louiskirsch / metagenrl
MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…
☆67Updated 5 years ago
yilunc2020 / Attention-DQN
Deep Recurrent Attention Reinforcement Learning in Atari
☆83Updated 7 years ago
krasheninnikov / max-causal-ent-irl
Maximum Causal Entropy Inverse Reinforcement Learning
☆48Updated 6 years ago
jianing-sun / Interpolated-Policy-Gradient-with-PPO-for-Robotics-Control-
Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…
☆37Updated 6 years ago
mengf1 / DHER
DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)
☆66Updated 5 years ago
jonasrothfuss / model_ensemble_meta_learning
Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm
☆44Updated 6 years ago
edouardklein / RL-and-IRL
C implementation of RL and IRL algorithms
☆19Updated 5 years ago
iclavera / learning_to_adapt
Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning
☆215Updated 2 years ago
dnddnjs / feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
☆96Updated 3 years ago
alversafa / option-critic-arch
Implementation of the Option-Critic Architecture
☆40Updated 6 years ago
ppocma / ppocma
☆72Updated 6 years ago
thanard / me-trpo
☆92Updated last year
maximilianigl / DVRL
Deep Variational Reinforcement Learning
☆136Updated 3 years ago
xlnwel / model-free-algorithms
TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x
☆62Updated 4 years ago
msinto93 / D4PG
Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…
☆126Updated 5 years ago
ewanlee / ICLR2019-RL-Papers
The Reinforcement-Learning-Related Papers of ICLR 2019
☆47Updated 6 years ago
cjm715 / mgym
A collection of multi-agent reinforcement learning OpenAI gym environments
☆45Updated 5 years ago
siekmanj / r2l
Recurrent continuous reinforcement learning algorithms implemented in Pytorch.
☆51Updated 4 years ago
Knoxantropicen / model-based-meta-rl
Self-implemented code for Model-Based Meta-Reinforcement Learning
☆17Updated 6 years ago
hu-po / pySACQ
PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)
☆37Updated 4 years ago
HumanCompatibleAI / population-irl
(Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards
☆28Updated 6 years ago