vwxyzjn / invalid-action-maskingLinks

Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

☆159

Alternatives and similar repositories for invalid-action-masking

Users that are interested in invalid-action-masking are comparing it to the libraries listed below

Sorting:

cyanrain7 / TRPO-in-MARL
☆214Updated 2 years ago
RunzheYang / MORL
Multi-Objective Reinforcement Learning
☆280Updated 3 years ago
oxwhirl / wqmix
Code for Weighted QMIX
☆138Updated 4 years ago
matteokarldonati / Counterfactual-Multi-Agent-Policy-Gradients
PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."
☆59Updated 5 years ago
cycraig / MP-DQN
Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"
☆219Updated 6 years ago
011235813 / hierarchical-marl
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery
☆103Updated 3 years ago
toshikwa / sac-discrete.pytorch
PyTorch implementation of SAC-Discrete.
☆307Updated last year
gxywy / rl-plotter
A plotter for reinforcement learning (RL)
☆227Updated 3 years ago
zoeyuchao / mappo
This is the official implementation of Multi-Agent PPO.
☆112Updated 2 years ago
DKuan / MADDPG_torch
The code for maddpg using pytorch
☆170Updated 4 years ago
Bigpig4396 / PyTorch-Counterfactual-Multi-Agent-Policy-Gradients-COMA
☆76Updated 5 years ago
isp1tze / MAProj
Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment
☆117Updated 2 years ago
AboudyKreidieh / h-baselines
A repository of high-performing hierarchical reinforcement learning models and algorithms.
☆317Updated 2 years ago
deligentfool / dqn_zoo
The implement of all kinds of dqn reinforcement learning with Pytorch
☆93Updated 4 years ago
oxwhirl / facmac
☆101Updated 3 years ago
wjh720 / QPLEX
☆97Updated 4 years ago
axelabels / DynMORL
Code for Dynamic Weights in Multi-Objective Deep Reinforcement Learning
☆97Updated 2 years ago
thomashirtz / gym-hybrid
Collection of OpenAI parametrized action-space environments.
☆65Updated 4 months ago
Felhof / DiscreteSAC
☆40Updated 3 years ago
chauncygu / Multi-Agent-Constrained-Policy-Optimisation
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
☆181Updated last year
cts198859 / deeprl_network
multi-agent deep reinforcement learning for networked system control.
☆420Updated 4 years ago
TonghanWang / NDQ
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
☆81Updated 2 years ago
kevslinger / DTQN
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
☆165Updated last year
rhoowd / sched_net
☆89Updated 3 years ago
atavakol / action-branching-agents
(AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning
☆116Updated 2 years ago
henrycharlesworth / multi_action_head_PPO
PPO with multi-head/autoregressive action outputs
☆42Updated 4 years ago
BY571 / SAC_discrete
PyTorch implementation of the discrete Soft-Actor-Critic algorithm.
☆53Updated 3 years ago
hsvgbkhgbv / SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …
☆119Updated 9 months ago
nikhilbarhate99 / Hierarchical-Actor-Critic-HAC-PyTorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
☆318Updated 3 years ago
Sonkyunghwan / QTRAN
There will be updates later
☆84Updated 6 years ago