bahh723 / model-free-rl-algos

☆8

Alternatives and similar repositories for model-free-rl-algos

Users that are interested in model-free-rl-algos are comparing it to the libraries listed below

Sorting:

behaviorguidedRL / BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Updated 4 years ago
DartML / PPO-Stein-Control-Variate
Proximal Policy Optimization with Stein Control Variates:
☆33Updated 7 years ago
Hwhitetooth / lirpg
☆61Updated 6 years ago
zafarali / emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
☆49Updated 3 years ago
ermongroup / CalibratedModelBasedRL
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆55Updated 6 years ago
iosband / TabulaRL
☆65Updated last year
alexis-jacq / LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆95Updated 6 years ago
sebascuri / hucrl
☆30Updated last year
david-abel / rl_abstraction
Code for experimenting with state and action abstractions in reinforcement learning.
☆31Updated 4 years ago
xkianteb / dril
Disagreement-Regularized Imitation Learning
☆30Updated 3 years ago
facebookresearch / level-replay
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …
☆86Updated 3 years ago
thanard / me-trpo
☆91Updated last year
arushijain94 / SafeOptionCritic
Safe Option-Critic: Learning Safety in the Option-Critic Architecture
☆20Updated 6 years ago
ruizhaogit / mep
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)
☆23Updated 5 years ago
RLAgent / state-marginal-matching
Efficient Exploration via State Marginal Matching (2019)
☆68Updated 5 years ago
Santara / stochastic_value_gradient
Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]
☆26Updated 3 years ago
nnaisense / MAGE
Learning Action-Value Gradients in Model-based Policy Optimization
☆31Updated 3 years ago
WilsonWangTHU / POPLIN
☆98Updated 2 years ago
brain-research / mirage-rl
Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.
☆17Updated 6 years ago
RomainLaroche / SPIBB
Safe Policy Improvement with Baseline Bootstrapping
☆26Updated 5 years ago
jonasrothfuss / model_ensemble_meta_learning
Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm
☆44Updated 6 years ago
aa14k / Exploration-in-RL
☆28Updated 11 months ago
mcmachado / options
☆43Updated 8 years ago
haoliuhl / taming-maml
Taming MAML: efficient unbiased meta-reinforcement learning
☆29Updated 2 years ago
manantomar / Mirror-Descent-Policy-Optimization
Mirror Descent Policy Optimization
☆38Updated 4 years ago
uber-research / D3G
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Updated 5 years ago
mcmachado / count_based_exploration_sr
☆31Updated 5 years ago
stratisMarkou / sample-efficient-bayesian-rl
Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL
☆25Updated 3 years ago
deep-skill-chaining / deep-skill-chaining
Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"
☆28Updated 5 years ago
011235813 / SEPT
Single Episode Policy Transfer in Reinforcement Learning
☆17Updated 2 years ago