theevann / MinimaxQ-LearningLinks
Applying minimaxQ learning algorithm to 2 agents games
☆33Updated 8 years ago
Alternatives and similar repositories for MinimaxQ-Learning
Users that are interested in MinimaxQ-Learning are comparing it to the libraries listed below
Sorting:
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆121Updated 4 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆52Updated 7 months ago
- Curiosity-driven Exploration by Self-supervised Prediction☆145Updated 2 years ago
- Lightweight multi-agent gridworld Gym environment☆212Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆105Updated 5 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆56Updated 3 years ago
- ☆77Updated 7 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆107Updated 3 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆147Updated 6 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆84Updated 4 years ago
- Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020☆55Updated last year
- Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.☆218Updated 6 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆331Updated 4 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆85Updated 3 years ago
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…☆137Updated 4 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆34Updated 6 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆198Updated last year
- Combining Evolutionary Algorithms and deep RL in various ways☆107Updated 5 years ago
- A collection of recent MARL papers☆99Updated last year
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆130Updated 2 years ago
- ☆115Updated 2 years ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆183Updated 2 years ago
- [ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning☆32Updated 2 years ago
- Gridworld for MARL experiments☆143Updated 4 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆54Updated last month
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆100Updated 6 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆46Updated 4 years ago
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆83Updated 2 years ago