theevann / MinimaxQ-LearningLinks
Applying minimaxQ learning algorithm to 2 agents games
☆33Updated 8 years ago
Alternatives and similar repositories for MinimaxQ-Learning
Users that are interested in MinimaxQ-Learning are comparing it to the libraries listed below
Sorting:
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆122Updated 5 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆146Updated 2 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆53Updated 8 months ago
- Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020☆55Updated last year
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆86Updated 3 years ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆89Updated 7 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆132Updated 2 years ago
- ☆116Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆105Updated 5 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆63Updated 7 years ago
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…☆139Updated 4 years ago
- ☆77Updated 8 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆147Updated 7 years ago
- Lightweight multi-agent gridworld Gym environment☆213Updated 2 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆56Updated 3 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 10 months ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆334Updated 4 years ago
- [ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning☆32Updated 2 years ago
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆83Updated 2 years ago
- Adaptive Attention Span for Reinforcement Learning☆136Updated 5 years ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆238Updated 5 years ago
- Learning Individual Intrinsic Reward in MARL☆63Updated 3 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆67Updated 2 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated 3 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method☆46Updated last year
- PyTorch implementation of FQF, IQN and QR-DQN.☆187Updated last year
- Level-based Foraging (LBF): A multi-agent environment for RL☆202Updated last year
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆149Updated 4 years ago