BY571 / Normalized-Advantage-Function-NAF-Links
PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method
☆28Updated 4 years ago
Alternatives and similar repositories for Normalized-Advantage-Function-NAF-
Users that are interested in Normalized-Advantage-Function-NAF- are comparing it to the libraries listed below
Sorting:
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆90Updated last year
- Safe Multi-Agent Robosuite benchmark for safe multi-agent reinforcement learning research.☆24Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆66Updated last year
- Formulating Model-based RL Dynamics as a continuous rather then one step prediction☆36Updated 3 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆67Updated 2 years ago
- Implementation of Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones.☆59Updated 2 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆155Updated last year
- Model Predictive Actor-Critic Reinforcement Learning☆66Updated 3 years ago
- JAX and PZ RL envs + algorithms for swarms of CrazyFlies☆81Updated last year
- A curated list of awesome Model-based reinforcement learning resources☆95Updated 5 years ago
- RL Algorithms for Visual Continuous Control☆34Updated 2 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Updated 3 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆72Updated last year
- Code for the paper "AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training"☆17Updated 3 years ago
- behavior cloning from observation☆36Updated 4 years ago
- A library for building reinforcement learning and imitation learning agents in Pytorch☆60Updated 5 years ago
- Advantage weighted Actor Critic for Offline RL☆50Updated 3 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆68Updated 2 years ago
- ☆40Updated 4 years ago
- DecentralizedLearning☆25Updated 2 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆32Updated 2 years ago
- The Starcraft Multi-Agent challenge lite☆41Updated last year
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020☆45Updated 2 years ago
- using recurrent networks(LSTM) to solve POMDPs☆35Updated 7 years ago
- Codebase for Model-Based Meta-Reinforcement Learning for Flight with Suspended Payloads paper. Website: https://sites.google.com/view/met…☆33Updated 2 years ago
- ☆20Updated 2 years ago
- [ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control☆122Updated 5 years ago
- Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"☆48Updated 3 years ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆73Updated 6 years ago
- ☆23Updated last year