theevann / MinimaxQ-LearningLinks
Applying minimaxQ learning algorithm to 2 agents games
☆33Updated 7 years ago
Alternatives and similar repositories for MinimaxQ-Learning
Users that are interested in MinimaxQ-Learning are comparing it to the libraries listed below
Sorting:
- Curiosity-driven Exploration by Self-supervised Prediction☆142Updated 2 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆53Updated 5 months ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆82Updated 2 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆147Updated 6 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems☆32Updated 7 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆193Updated last year
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆84Updated 4 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 7 months ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆121Updated 4 years ago
- Pytorch implementation of distributed deep reinforcement learning☆76Updated 3 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆50Updated 2 years ago
- Adaptive Attention Span for Reinforcement Learning☆135Updated 5 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- ppo-lstm-parallel☆47Updated 6 years ago
- Learning Individual Intrinsic Reward in MARL☆63Updated 2 years ago
- ☆114Updated 2 years ago
- Lightweight multi-agent gridworld Gym environment☆211Updated 2 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆138Updated last year
- ☆26Updated 7 years ago
- Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)☆67Updated 5 years ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆181Updated 2 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆63Updated 7 years ago
- ☆77Updated 7 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆30Updated 6 years ago
- ☆78Updated last year
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆148Updated 3 years ago
- Prioritized Experience Replay implementation with proportional prioritization☆84Updated 2 years ago
- Experiments with reinforcement learning and recurrent neural networks☆115Updated 2 years ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆89Updated 6 years ago