kchua / handful-of-trialsLinks

Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"

☆453

Alternatives and similar repositories for handful-of-trials

Users that are interested in handful-of-trials are comparing it to the libraries listed below

Sorting:

WilsonWangTHU / mbbl
☆392Updated 6 years ago
justinjfu / inverse_rl
☆274Updated 7 years ago
jannerm / mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
☆507Updated 2 years ago
quanvuong / handful-of-trials-pytorch
Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
☆191Updated 2 years ago
Kaixhin / PlaNet
Deep Planning Network: Control from pixels by latent planning with learned dynamics
☆370Updated 3 years ago
nrontsis / PILCO
Bayesian Reinforcement Learning in Tensorflow
☆332Updated 4 years ago
vitchyr / multiworld
Multitask Environments for RL
☆279Updated 4 years ago
katerakelly / oyster
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
☆498Updated 2 years ago
denisyarats / pytorch_sac_ae
PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)
☆248Updated 5 years ago
alexlee-gk / slac
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
☆152Updated 4 years ago
iclavera / learning_to_adapt
Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning
☆215Updated 2 years ago
juliusfrost / dreamer-pytorch
Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.
☆305Updated last year
yrlu / irl-imitation
Implementation of Inverse Reinforcement Learning (IRL) algorithms in Python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL
☆642Updated last year
danijar / dreamer
Dream to Control: Learning Behaviors by Latent Imagination
☆547Updated 3 years ago
andrew-j-levy / Hierarchical-Actor-Critc-HAC-
This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.
☆260Updated 5 years ago
Farama-Foundation / D4RL-Evaluations
☆201Updated 2 years ago
anagabandi / nn_dynamics
☆345Updated 7 years ago
ikostrikov / pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
☆447Updated 6 years ago
denisyarats / pytorch_sac
PyTorch implementation of Soft Actor-Critic (SAC)
☆556Updated 3 years ago
thanard / me-trpo
☆92Updated last year
jonasrothfuss / ProMP
Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…
☆238Updated 2 years ago
befelix / safe_learning
Safe reinforcement learning with stability guarantees
☆233Updated 3 years ago
google-research / realworldrl_suite
Real-World RL Benchmark Suite
☆355Updated 5 years ago
aravindr93 / mjrl
Reinforcement learning algorithms for MuJoCo tasks
☆418Updated 5 months ago
twni2016 / pomdp-baselines
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
☆327Updated last year
denisyarats / dmc2gym
OpenAI Gym wrapper for the DeepMind Control Suite
☆219Updated last year
rll-research / url_benchmark
☆351Updated 2 years ago
openai / safety-gym
Tools for accelerating safe exploration research.
☆550Updated 2 years ago
maximilianigl / DVRL
Deep Variational Reinforcement Learning
☆136Updated 3 years ago
AboudyKreidieh / h-baselines
A repository of high-performing hierarchical reinforcement learning models and algorithms.
☆316Updated 2 years ago