theevann / MinimaxQ-LearningLinks

Applying minimaxQ learning algorithm to 2 agents games

☆33

Alternatives and similar repositories for MinimaxQ-Learning

Users that are interested in MinimaxQ-Learning are comparing it to the libraries listed below

Sorting:

BY571 / DQN-Atari-Agents
DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…
☆121Updated 4 years ago
MarcoMeter / episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
☆183Updated last year
jcwleo / curiosity-driven-exploration-pytorch
Curiosity-driven Exploration by Self-supervised Prediction
☆139Updated 2 years ago
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆51Updated 2 months ago
toshikwa / soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
☆103Updated 5 years ago
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆143Updated 6 years ago
fshamshirdar / pytorch-rdpg
PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)
☆55Updated 2 years ago
jerrodparker20 / adaptive-transformers-in-rl
Adaptive Attention Span for Reinforcement Learning
☆133Updated 5 years ago
navuboy / gail_gym
Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.
☆89Updated 6 years ago
AntoineTheb / RNN-RL
Experiments with reinforcement learning and recurrent neural networks
☆114Updated last year
lweitkamp / feudalnets-pytorch
PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.
☆41Updated 5 years ago
rohan-sawhney / multi-agent-rl
☆77Updated 7 years ago
toshikwa / gail-airl-ppo.pytorch
PyTorch implementation of GAIL and AIRL based on PPO.
☆222Updated 4 years ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆134Updated 3 weeks ago
TonghanWang / NDQ
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
☆81Updated 2 years ago
watakandai / hiro_pytorch
Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)
☆111Updated 4 years ago
ermongroup / MA-AIRL
Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.
☆210Updated 6 years ago
dhruvramani / Transformers-RL
An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"
☆180Updated 2 years ago
ArnaudFickinger / gym-multigrid
Lightweight multi-agent gridworld Gym environment
☆209Updated last year
Improbable-AI / eipo
Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization
☆82Updated 2 years ago
skumar9876 / Hierarchical-DQN
Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…
☆86Updated 7 years ago
fschur / DDQN-with-PyTorch-for-OpenAI-Gym
Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.
☆69Updated 2 months ago
011235813 / hierarchical-marl
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery
☆105Updated 3 years ago
Xingyu-Lin / mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆175Updated 3 years ago
gjp1203 / nui_in_madrl
Negative Update Intervals in Multi-Agent Deep Reinforcement Learning
☆33Updated 6 years ago
salesforce / sibling-rivalry
Code for Sibling Rivalry and experiments presented in associated paper
☆18Updated 3 months ago
ReinholdM / Offline-Pre-trained-Multi-Agent-Decision-Transformer
☆112Updated 2 years ago
xtma / simple-pytorch-rl
Reinforcement Learning Methods with PyTorch
☆39Updated 5 years ago
johannah / bootstrap_dqn
Implementation of Bootstrap DQN and Randomized Prior Functions on ALE
☆54Updated 4 months ago
TianhongDai / distributed-ppo
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
☆62Updated 7 years ago