haarnoja / softqlearningLinks

Reinforcement Learning with Deep Energy-Based Policies

☆430

Alternatives and similar repositories for softqlearning

Users that are interested in softqlearning are comparing it to the libraries listed below

Sorting:

ikostrikov / pytorch-ddpg-naf
Implementation of algorithms for continuous control (DDPG and NAF).
☆310Updated 4 years ago
pat-coady / trpo
Trust Region Policy Optimization with TensorFlow and OpenAI Gym
☆360Updated 5 years ago
Kaixhin / ACER
Actor-critic with experience replay
☆254Updated 2 years ago
stevenpjg / ddpg-aigym
Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym envir…
☆275Updated 7 years ago
ikostrikov / pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
☆441Updated 6 years ago
hoangminhle / hierarchical_IL_RL
Code for hierarchical imitation learning and reinforcement learning
☆294Updated 7 years ago
davidhershey / feudal_networks
An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161
☆183Updated 7 years ago
uber-research / ape-x
This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"
☆190Updated 6 years ago
jeanharb / option_critic
Implementation of the Option-Critic Architecture on the Atari (ALE) environment
☆179Updated 7 years ago
dgriff777 / a3c_continuous
A continuous action space version of A3C LSTM in pytorch plus A3G design
☆258Updated 9 months ago
jachiam / cpo
Constrained Policy Optimization
☆322Updated 8 years ago
junhyukoh / self-imitation-learning
ICML 2018 Self-Imitation Learning
☆278Updated 5 years ago
alexis-jacq / Pytorch-DPPO
Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286
☆183Updated 7 years ago
MG2033 / A2C
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
☆181Updated 6 years ago
jonasrothfuss / ProMP
Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…
☆238Updated 2 years ago
Damcy / prioritized-experience-replay
implement of prioritized experience replay
☆159Updated 6 years ago
anagabandi / nn_dynamics
☆345Updated 7 years ago
openai / imitation
Code for the paper "Generative Adversarial Imitation Learning"
☆714Updated 6 years ago
awjuliani / Meta-RL
Implementation of Meta-RL A3C algorithm
☆405Updated 8 years ago
go2sea / DQfD
An implement of DQfD（Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…
☆132Updated 7 years ago
andrewliao11 / gail-tf
Tensorflow implementation of generative adversarial imitation learning
☆199Updated 7 years ago
joschu / modular_rl
Implementation of TRPO and related algorithms
☆634Updated 7 years ago
higgsfield / Imagination-Augmented-Agents
Building Agents with Imagination: pytorch step-by-step implementation
☆209Updated 6 years ago
Breakend / gym-extensions
This repo is intended as an extension for OpenAI Gym for auxiliary tasks (multitask learning, transfer learning, inverse reinforcement le…
☆215Updated 6 years ago
openai / gym-soccer
☆303Updated 2 years ago
justinjfu / inverse_rl
☆274Updated 7 years ago
WilsonWangTHU / mbbl
☆392Updated 6 years ago
hengyuan-hu / rainbow
A PyTorch implementation of Rainbow DQN agent
☆168Updated 7 years ago
mrkulk / hierarchical-deep-RL
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation
☆87Updated 7 years ago
vitchyr / multiworld
Multitask Environments for RL
☆278Updated 3 years ago