Stepan-Makarenko / ICM-PPO-implementation
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML
☆13Updated 9 months ago
Related projects: ⓘ
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆87Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆44Updated 3 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆60Updated 10 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆108Updated last year
- ☆39Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆146Updated last year
- The implementation of LSTM-TD3.☆60Updated last year
- Model-Free Safe Reinforcement Learning through Neural Barrier Certificate☆28Updated 4 months ago
- behavior cloning from observation☆34Updated 3 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆61Updated last year
- Implementation of Off Policy Adversarial Inverse Reinforcement Learning☆20Updated 3 years ago
- Distributional Soft Actor Critic☆49Updated 4 years ago
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆22Updated 5 years ago
- DSAC; Distributional Soft Actor-Critic☆108Updated 6 months ago
- There will be updates later☆79Updated 5 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆56Updated this week
- Official implementation of the paper `Augmenting GAIL with BC for sample efficient imitation learning` in PyTorch☆30Updated 3 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆75Updated 9 months ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆34Updated 4 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆70Updated 9 months ago
- RL Algorithms for Visual Continuous Control☆30Updated last year
- ppo-lstm-parallel☆42Updated 5 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆113Updated last month
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆50Updated last year
- PyTorch implementation of discrete version of Soft Actor-Critic.☆27Updated 3 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆139Updated 5 months ago
- This is the official repository for the paper "Guided Exploration with Proximal Policy Optimization using a Single Demonstration", https:…☆17Updated 2 years ago
- PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method☆26Updated 3 years ago
- [NeurIPS 2022] Code for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments☆12Updated last year
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆41Updated 2 years ago