zhihanyang2022 / off-policy-continuous-control
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆79Updated last year
Related projects ⓘ
Alternatives and complementary repositories for off-policy-continuous-control
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆62Updated last year
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 3 months ago
- DecentralizedLearning☆21Updated last year
- Distributional Soft Actor Critic☆49Updated 4 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆125Updated 6 months ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆62Updated last year
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆43Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆62Updated 4 months ago
- behavior cloning from observation☆35Updated 3 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆152Updated last week
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆35Updated 2 months ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆39Updated 2 years ago
- Official implementation of the paper `Augmenting GAIL with BC for sample efficient imitation learning` in PyTorch☆32Updated 3 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆60Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆51Updated 5 months ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆98Updated 4 years ago
- ☆38Updated last year
- PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).☆87Updated 3 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆117Updated last year
- DSAC; Distributional Soft Actor-Critic☆113Updated 9 months ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated last year
- The implementation of LSTM-TD3.☆64Updated last year
- ☆61Updated last year
- Deep Reinforcement Learning for Continuous Control in PyTorch☆93Updated 2 years ago
- A library for building reinforcement learning and imitation learning agents in Pytorch☆58Updated 4 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆157Updated 2 years ago
- JAX and PZ RL envs + algorithms for swarms of CrazyFlies☆63Updated 2 months ago
- Advantage weighted Actor Critic for Offline RL☆47Updated 2 years ago