BY571 / Normalized-Advantage-Function-NAF-
PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method
☆29Updated 4 years ago
Alternatives and similar repositories for Normalized-Advantage-Function-NAF-:
Users that are interested in Normalized-Advantage-Function-NAF- are comparing it to the libraries listed below
- Robust and safe deep reinforcement learning algorithms☆11Updated 10 months ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆14Updated last year
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆64Updated last year
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆81Updated last year
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆42Updated 5 months ago
- Baseline implementation of recurrent PPO using truncated BPTT☆134Updated 9 months ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆55Updated 8 months ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆31Updated 2 years ago
- ☆14Updated 5 years ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆69Updated 5 years ago
- RL Algorithms for Visual Continuous Control☆33Updated last year
- Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".