BY571 / Normalized-Advantage-Function-NAF-
PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method
☆29Updated 4 years ago
Alternatives and similar repositories for Normalized-Advantage-Function-NAF-:
Users that are interested in Normalized-Advantage-Function-NAF- are comparing it to the libraries listed below
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆60Updated 9 months ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆15Updated last year
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆83Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆106Updated 3 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆138Updated 11 months ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆43Updated 6 months ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆65Updated 6 months ago
- Robust and safe deep reinforcement learning algorithms☆13Updated last year
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆70Updated 5 years ago
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆23Updated 5 years ago
- ☆48Updated 3 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method