jannerm / mbpoLinks

Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"

☆504

Alternatives and similar repositories for mbpo

Users that are interested in mbpo are comparing it to the libraries listed below

Sorting:

nikhilbarhate99 / Hierarchical-Actor-Critic-HAC-PyTorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
☆318Updated 3 years ago
AboudyKreidieh / h-baselines
A repository of high-performing hierarchical reinforcement learning models and algorithms.
☆316Updated 2 years ago
andrew-j-levy / Hierarchical-Actor-Critc-HAC-
This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.
☆259Updated 5 years ago
denisyarats / pytorch_sac
PyTorch implementation of Soft Actor-Critic (SAC)
☆555Updated 3 years ago
katerakelly / oyster
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
☆496Updated 2 years ago
TianhongDai / hindsight-experience-replay
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
☆433Updated 3 years ago
schroederdewitt / multiagent_mujoco
Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.
☆357Updated 2 years ago
kchua / handful-of-trials
Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
☆451Updated 2 years ago
toshikwa / sac-discrete.pytorch
PyTorch implementation of SAC-Discrete.
☆307Updated last year
aviralkumar2907 / CQL
Code for conservative Q-learning
☆450Updated 3 years ago
sfujim / TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
☆366Updated 3 years ago
WilsonWangTHU / mbbl
☆392Updated 6 years ago
sfujim / BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
☆636Updated 4 years ago
BY571 / Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…
☆288Updated 4 years ago
lcswillems / torch-ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
☆203Updated 2 years ago
denisyarats / pytorch_sac_ae
PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)
☆248Updated 5 years ago
twni2016 / pomdp-baselines
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
☆326Updated 11 months ago
openai / safety-starter-agents
Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.
☆435Updated 2 years ago
rlcode / per
Prioritized Experience Replay (PER) implementation in PyTorch
☆345Updated 5 years ago
justinjfu / inverse_rl
☆274Updated 7 years ago
ShawK91 / Evolutionary-Reinforcement-Learning
Codebase for Evolutionary Reinforcement Learning (ERL) from the paper "Evolution-Guided Policy Gradients in Reinforcement Learning" publi…
☆236Updated 4 years ago
aravindr93 / mjrl
Reinforcement learning algorithms for MuJoCo tasks
☆416Updated 4 months ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆134Updated 2 weeks ago
quanvuong / handful-of-trials-pytorch
Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
☆190Updated 2 years ago
danijar / dreamer
Dream to Control: Learning Behaviors by Latent Imagination
☆547Updated 3 years ago
ArnaudFickinger / gym-multigrid
Lightweight multi-agent gridworld Gym environment
☆209Updated last year
Farama-Foundation / D4RL-Evaluations
☆199Updated 2 years ago
openai / safety-gym
Tools for accelerating safe exploration research.
☆548Updated 2 years ago
vitchyr / multiworld
Multitask Environments for RL
☆278Updated 3 years ago
toshikwa / fqf-iqn-qrdqn.pytorch
PyTorch implementation of FQF, IQN and QR-DQN.
☆180Updated last year