zhihanyang2022 / off-policy-continuous-controlLinks
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆90Updated last year
Alternatives and similar repositories for off-policy-continuous-control
Users that are interested in off-policy-continuous-control are comparing it to the libraries listed below
Sorting:
- Baseline implementation of recurrent PPO using truncated BPTT☆156Updated last year
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆137Updated 2 months ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆67Updated 2 years ago
- DecentralizedLearning☆25Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆173Updated last year
- A curated list of awesome Model-based reinforcement learning resources☆95Updated 5 years ago
- ☆45Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆105Updated 5 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆66Updated last year
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆78Updated 2 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆68Updated 2 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆139Updated last year
- ☆23Updated last year
- Unofficial re-implementation of "Learning Latent Dynamics for Planning from Pixels" (https://arxiv.org/abs/1811.04551 ) with PyTorch☆46Updated 5 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆52Updated 6 months ago
- Advantage weighted Actor Critic for Offline RL☆50Updated 3 years ago
- A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…☆39Updated last year
- Gym-like extensions for POMDP☆57Updated 4 years ago
- Soft Actor-Critic with advanced features☆51Updated 3 weeks ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆150Updated 4 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆97Updated 5 years ago
- Distributional Soft Actor Critic☆59Updated 5 years ago
- An OpenAI Gym environment for multi-agent car racing based on Gym's original car racing environment.☆88Updated 3 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 3 months ago
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆83Updated 2 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆194Updated last year
- Implementation of the Option-Critic Architecture☆40Updated 6 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆43Updated 3 years ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆53Updated 4 years ago