BY571 / D4PG
PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2RL which can be added to D4PG to improve its performance.
☆18Updated 3 years ago
Alternatives and similar repositories for D4PG:
Users that are interested in D4PG are comparing it to the libraries listed below
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Updated 2 years ago
- A collection of environments and reference agents for planning and reinforcement learning research in partially observable, multi-agent …☆18Updated 2 months ago
- ☆19Updated 7 months ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆38Updated 3 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆64Updated last year
- DecentralizedLearning☆22Updated 2 years ago
- Prioritized Sequence Experience Replay☆10Updated 3 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆27Updated 4 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆79Updated last year
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- This repository has code for the paper "Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algor…☆26Updated last year
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆39Updated 2 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Updated 2 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆43Updated 4 years ago
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆42Updated 5 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆20Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆52Updated 7 months ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆44Updated 10 months ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆134Updated 5 months ago
- Model-based reinforcement learning using CEM, MPC and PETS☆16Updated 5 years ago
- PyTorch IMPALA implementation☆25Updated 5 years ago
- Implementations of SAILR, PDO, and CSC☆31Updated 6 months ago
- Model-based Policy Gradients☆30Updated 4 years ago
- soft q learning and soft actor critic☆15Updated 6 years ago
- using recurrent networks(LSTM) to solve POMDPs☆35Updated 6 years ago
- Distributional Soft Actor Critic☆50Updated 4 years ago
- RL-Toolkit: A Research Framework for Robotics☆19Updated last week
- Implementation of Tactical Optimistic and Pessimistic value estimation☆24Updated last year
- Delayed RL agent for non-Atari tasks, from "Acting in Delayed Environments with Non-Stationary Markov Policies", ICLR 2021.☆14Updated last year