BY571 / Normalized-Advantage-Function-NAF-Links
PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method
☆28Updated 4 years ago
Alternatives and similar repositories for Normalized-Advantage-Function-NAF-
Users that are interested in Normalized-Advantage-Function-NAF- are comparing it to the libraries listed below
Sorting:
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆90Updated 2 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆67Updated 2 years ago
- Safe Multi-Agent Robosuite benchmark for safe multi-agent reinforcement learning research.☆25Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆69Updated last year
- Formulating Model-based RL Dynamics as a continuous rather then one step prediction☆36Updated 3 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆156Updated last year
- Model Predictive Actor-Critic Reinforcement Learning☆67Updated 4 years ago
- A curated list of awesome Model-based reinforcement learning resources☆95Updated 5 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆174Updated last year
- Novel Reinforcement Learning method for tackling goal-oriented robotics tasks with obstacles.☆38Updated 3 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Updated 3 years ago
- ☆46Updated last week
- behavior cloning from observation☆36Updated 4 years ago
- DecentralizedLearning☆24Updated 2 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆68Updated 2 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆137Updated 3 months ago
- RL Algorithms for Visual Continuous Control☆36Updated 2 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆29Updated 5 years ago
- Advantage weighted Actor Critic for Offline RL☆50Updated 3 years ago
- Implementation of Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones.☆59Updated 2 years ago
- The Starcraft Multi-Agent challenge lite☆42Updated last year
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆80Updated 2 years ago
- JAX and PZ RL envs + algorithms for swarms of CrazyFlies☆82Updated last year
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆72Updated last year
- A simple implementation of Generative Adversarial Imitation Learning with PyTorch☆170Updated 3 years ago
- Gym-like extensions for POMDP☆57Updated 4 years ago
- A library for building reinforcement learning and imitation learning agents in Pytorch☆60Updated 5 years ago
- The repository is intended as a support tool for the report of the project "Sim to Real transfer of Reinforcement Learning Policies in Ro…☆13Updated 2 years ago
- DDPGfD: This is our implementation project for the Reinforcement Learning course in NCTU.☆35Updated 3 years ago