theevann / MinimaxQ-LearningLinks
Applying minimaxQ learning algorithm to 2 agents games
☆33Updated 7 years ago
Alternatives and similar repositories for MinimaxQ-Learning
Users that are interested in MinimaxQ-Learning are comparing it to the libraries listed below
Sorting:
- Curiosity-driven Exploration by Self-supervised Prediction☆144Updated 2 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆52Updated 6 months ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆121Updated 4 years ago
- Lightweight multi-agent gridworld Gym environment☆212Updated 2 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆146Updated 6 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆105Updated 5 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆84Updated 2 years ago
- Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020☆55Updated last year
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- [ICML 2019] Implementation of "Imitation Learning from Imperfect Demonstration"☆48Updated 6 years ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆88Updated 6 years ago
- Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)☆68Updated 5 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆139Updated last year
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆56Updated 2 years ago
- ☆85Updated 6 years ago
- ☆78Updated last year
- Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems☆32Updated 7 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆106Updated 5 years ago
- ☆77Updated 7 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆84Updated 4 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆63Updated 7 years ago
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆97Updated 5 months ago
- Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.☆214Updated 6 years ago
- Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks☆219Updated 2 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆137Updated 2 months ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆46Updated 3 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆62Updated 6 years ago
- Experiments with reinforcement learning and recurrent neural networks☆115Updated 2 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆197Updated last year
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆97Updated 5 years ago