BY571 / Normalized-Advantage-Function-NAF-
PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method
☆27Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Normalized-Advantage-Function-NAF-
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆14Updated 10 months ago
- The implementation of LSTM-TD3.☆62Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆50Updated 4 months ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆62Updated last year
- Model-Free Safe Reinforcement Learning through Neural Barrier Certificate☆29Updated 6 months ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆58Updated last month
- RL Algorithms for Visual Continuous Control☆32Updated last year
- Implementation of Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones.☆51Updated last year
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆77Updated 11 months ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆40Updated last month
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆92Updated 3 years ago
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆23Updated 5 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆62Updated last year
- Official open-source implementation of ICML 2022 paper: Reachability Constrainted Reinforcement Learning.☆30Updated 2 years ago
- ☆131Updated 5 years ago
- behavior cloning from observation☆35Updated 3 years ago
- Repository containing the code for the paper "Safe Model-Based Reinforcement Learning using Robust Control Barrier Functions". Specifical…☆36Updated 2 years ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆66Updated 5 years ago
- Implementation of Robust Reinforcement Learning using Offline Data [NeurIPS'22]☆22Updated last year
- An MPC algorithm which supports polytopic state and action constraints, using CEM optimisation.☆13Updated 5 years ago
- Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"☆40Updated 2 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆123Updated 6 months ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆45Updated 4 years ago
- Pytorch version of the MPC in model-based reinforcement learning (MBRL), currently only test in the CartPole-swing-up environment☆77Updated 4 years ago
- Gym environment for cooperative multi-agent reinforcement learning in heterogeneous robot teams☆39Updated 2 years ago
- ☆13Updated 4 years ago
- Learning Safe Multi-Agent Control with Decentralized Neural Barrier Certificates☆70Updated last year
- Heterogeneous Multi-Robot Reinforcement Learning☆34Updated 2 months ago
- Transformer in RL for decision-making☆75Updated last year
- A DRL implementation repo☆19Updated last month