yatharthgarg / Reinforcement-LearningLinks

Using WoLF (win or learn fast) PHC (policy hill climbing) algorithm to implement stochastic games

☆14

Alternatives and similar repositories for Reinforcement-Learning

Users that are interested in Reinforcement-Learning are comparing it to the libraries listed below

Sorting:

LeoZhengZLY / stackelberg-actor-critic-algos
☆41Updated 3 years ago
JohannesAck / tf2multiagentrl
Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x
☆157Updated last year
skumar9876 / FCRL
Implementation of "Federated Control with Hierarchical Multi-Agent Deep Reinforcement Learning" (https://arxiv.org/pdf/1712.08266.pdf)
☆39Updated 6 years ago
LoveDoveDog / MAPPO
Lightweight multi-agent PPO for IEEE field.
☆14Updated 3 years ago
Metro1998 / P-DQN
Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space
☆50Updated 3 years ago
maohangyu / marl_demo
demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…
☆60Updated 4 years ago
jianzhnie / RLToolkit
RLToolkit is a flexible and high-efficient reinforcement learning framework. Include implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG,…
☆17Updated last year
minruixu / MAFRL
code for
☆10Updated 4 years ago
ffelten / MASAC
Jax and Torch Multi-Agent SAC on PettingZoo API
☆86Updated 7 months ago
rhoowd / sched_net
☆86Updated 3 years ago
zouchangjie / RL-Nash-Q-learning
强化学习中纳什Qlearning 实现矩阵博弈
☆30Updated 6 years ago
TroddenSpade / Meta-Reinforcement-Learning
Code snippets of Meta Reinforcement Learning algorithms
☆38Updated last year
restorenode / mappo-competitive-reinforcement
🎾 Multi-Agent Proximal Policy Optimization approach to a competitive reinforcement learning problem
☆22Updated 2 years ago
FlickerNiko / SAC-QMIX
Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.
☆50Updated 3 years ago
JohannesAck / MATD3implementation
Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…
☆87Updated 4 years ago
X-I-N / my_PDQN
my code for paper Parameterized-DQN
☆22Updated 4 years ago
Happymic / SkyNetRL-Multi-Agent-Reinforcement-Learning-for-Space-Air-Ground-Networks
A multi-agent reinforcement learning framework for optimizing coverage and connectivity in Space-Air-Ground integrated networks. This pro…
☆25Updated 4 months ago
BIT-MCS / DRL-CEWS
[ICDE 2020] Curiosity-Driven Energy-Efficient Worker Scheduling in Vehicular Crowdsourcing: A Deep Reinforcement Learning Approach
☆17Updated 3 years ago
EnnaSachdeva / Recurrent-Multiagent-Deep-Deterministic-Policy-Gradient-with-Difference-Rewards
Deep Reinforcement Learning (DRL) algorithms have been successfully applied to a range of challenging simulated continuous control single…
☆51Updated 6 years ago
MISTCARRYYOU / MASA-QMIX
Codes for paper of 'Solving job scheduling problems in a resource preemption environment with multi-agent reinforcement learning'
☆46Updated 2 years ago
DRACOsource / draco
☆16Updated 3 years ago
BIT-MCS / GCRL-min-AoI
[INFOCOM 2022] AoI-minimal UAV Crowdsensing by Model-based Graph Convolutional Reinforcement Learning
☆54Updated 2 years ago
Zhuzzq / EdgeFed-MARL-MEC
☆73Updated 3 years ago
livey / scalable_maddpg
scalable multi agents reinforcement learning
☆62Updated 7 years ago
yerfor / Soft-DRGN
Official Pytorch implementation of Soft-DRGN (IEEE trans on Mobile Computing 2022)
☆37Updated 3 years ago
RobvanGastel / rnn-sac
Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch
☆27Updated 2 years ago
bdvllrs / marl-patrolling-agents
Project on multi agent reinforcement learning applied on patrolling agents
☆40Updated 5 years ago
xiaoyandong08 / maddpg-mpe
Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).
☆20Updated 7 years ago
sailik1991 / MarkovGameSolvers
This is code for finding the minimax/nash/stackelberg strategy of players in Markov Games.
☆25Updated 3 weeks ago
philtabor / Multi-Agent-Deep-Deterministic-Policy-Gradients
A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm
☆350Updated 4 years ago