AurelianTactics / dqnclipped_dqnreg_prelim_implementation
Implementing DQNClipped and DQNReg Algorithms
☆10Updated 4 years ago
Alternatives and similar repositories for dqnclipped_dqnreg_prelim_implementation
Users that are interested in dqnclipped_dqnreg_prelim_implementation are comparing it to the libraries listed below
Sorting:
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆20Updated 5 months ago
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆30Updated last year
- A clean and robust Pytorch implementation of SAC on discrete action space☆37Updated 6 months ago
- Implementation code for GraphMIX: Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning☆32Updated 4 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆52Updated 3 years ago
- ☆42Updated 3 years ago
- ☆27Updated 4 years ago
- ☆21Updated last year
- qmix☆22Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆28Updated 2 years ago
- The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》☆41Updated 3 years ago
- ☆39Updated 2 years ago
- Bayesian Soft Actor Critic☆14Updated 2 years ago
- Official Repository for "Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing" (ICML2021)☆21Updated 3 years ago
- Codes for the paper "HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism"☆22Updated 2 years ago
- Decision Transformer: A brand new Offline RL Pattern.☆36Updated 3 years ago
- ☆40Updated 3 years ago
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆13Updated 3 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆61Updated last year
- Revisiting Discrete Gradient Estimation in MADDPG☆24Updated 2 years ago
- Implementation of DyMA-CL, MARL algorithm☆27Updated 5 years ago
- This is the official implementation of ERL-Re2.☆64Updated 10 months ago
- Official repository of the paper TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Proble…☆49Updated last year
- simple code to reinforcement learning☆20Updated 4 years ago
- Paper list for constrained policy optimization in reinforcement learning.☆72Updated last year
- ☆29Updated 2 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆77Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆87Updated last year
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆58Updated 5 years ago