BY571 / Normalized-Advantage-Function-NAF-Links
PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method
☆28Updated 4 years ago
Alternatives and similar repositories for Normalized-Advantage-Function-NAF-
Users that are interested in Normalized-Advantage-Function-NAF- are comparing it to the libraries listed below
Sorting:
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆90Updated 2 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆70Updated last year
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆66Updated 2 years ago
- Safe Multi-Agent Robosuite benchmark for safe multi-agent reinforcement learning research.☆26Updated last year
- Baseline implementation of recurrent PPO using truncated BPTT☆159Updated last year
- Formulating Model-based RL Dynamics as a continuous rather then one step prediction☆36Updated 3 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆85Updated last week
- Model Predictive Actor-Critic Reinforcement Learning☆68Updated 4 years ago
- Advantage weighted Actor Critic for Offline RL☆52Updated 3 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆72Updated last year
- A library for building reinforcement learning and imitation learning agents in Pytorch☆61Updated 5 years ago
- RL Algorithms for Visual Continuous Control☆36Updated 2 years ago
- ☆48Updated last month
- Novel Reinforcement Learning method for tackling goal-oriented robotics tasks with obstacles.☆38Updated 3 years ago
- Implementation of Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones.☆59Updated 2 years ago
- A curated list of awesome Model-based reinforcement learning resources☆95Updated 5 years ago
- Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.☆41Updated 4 years ago
- A modified benchmark for designing and controlling 2D Voxel-based Soft Robots☆38Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Updated last year
- behavior cloning from observation☆38Updated 5 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Updated 3 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆71Updated 2 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆137Updated 4 months ago
- DDPGfD: This is our implementation project for the Reinforcement Learning course in NCTU.☆35Updated 3 years ago
- JAX and PZ RL envs + algorithms for swarms of CrazyFlies☆87Updated last year
- Robust and safe deep reinforcement learning algorithms☆16Updated last year
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆45Updated 3 months ago
- Gym-like extensions for POMDP☆56Updated 4 years ago
- PyTorch code accompanying the paper "Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning" (NeurIPS 2020 spot…☆44Updated 2 years ago
- [ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control☆125Updated 5 years ago