LucasAlegre / sac-plusLinks
Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).
☆15Updated 4 years ago
Alternatives and similar repositories for sac-plus
Users that are interested in sac-plus are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆29Updated 5 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51Updated 4 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆137Updated 4 months ago
- PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).☆94Updated last year
- NeurIPS Reproducibility Challenge 2019☆20Updated 5 years ago
- Soft Actor-Critic with advanced features☆50Updated last week
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆78Updated 3 years ago
- PyTorch IMPALA implementation☆27Updated 6 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆105Updated 4 years ago
- Unofficial re-implementation of "Learning Latent Dynamics for Planning from Pixels" (https://arxiv.org/abs/1811.04551 ) with PyTorch☆47Updated 5 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆90Updated 2 years ago
- ☆15Updated 6 years ago
- MVE: model-based value estimation☆11Updated 7 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆52Updated 8 months ago
- POMDP wrappers for OpenAI Gym☆15Updated 6 years ago
- Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.☆40Updated 3 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated 2 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆194Updated 3 years ago
- ☆48Updated last month
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Updated 5 years ago
- Advantage weighted Actor Critic for Offline RL☆52Updated 3 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆66Updated 2 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆103Updated 3 years ago
- A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…☆40Updated last year
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Updated last year
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆41Updated 3 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆62Updated 6 years ago
- Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Re…☆108Updated 3 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆183Updated 3 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆149Updated 4 years ago