BY571 / Normalized-Advantage-Function-NAF-
PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method
☆27Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Normalized-Advantage-Function-NAF-
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆51Updated 5 months ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆14Updated 11 months ago
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆23Updated 5 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆79Updated last year
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆62Updated last year
- Model-Free Safe Reinforcement Learning through Neural Barrier Certificate☆30Updated 6 months ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆40Updated 2 months ago
- Model Predictive Actor-Critic Reinforcement Learning☆52Updated 3 years ago
- The implementation of LSTM-TD3.☆64Updated last year
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆66Updated 5 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆95Updated 3 years ago
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆35Updated 2 months ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆32Updated 2 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆58Updated 2 months ago
- Formulating Model-based RL Dynamics as a continuous rather then one step prediction☆35Updated 2 years ago
- Robust and safe deep reinforcement learning algorithms☆10Updated 7 months ago
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Updated last year
- Advantage weighted Actor Critic for Offline RL☆47Updated 2 years ago
- JAX and PZ RL envs + algorithms for swarms of CrazyFlies☆63Updated 2 months ago
- Transformer in RL for decision-making☆75Updated last year
- Official open-source implementation of ICML 2022 paper: Reachability Constrainted Reinforcement Learning.☆31Updated 2 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆62Updated 4 months ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆60Updated last year
- Implementation of Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones.☆52Updated last year
- An MPC algorithm which supports polytopic state and action constraints, using CEM optimisation.☆13Updated 5 years ago
- PyTorch Implementation of Hamilton-Jacobi DQN☆14Updated 3 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆125Updated 6 months ago
- RL Algorithms for Visual Continuous Control☆32Updated last year
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆62Updated last year
- Implementation of Off Policy Adversarial Inverse Reinforcement Learning☆21Updated 4 years ago