theevann / MinimaxQ-LearningLinks
Applying minimaxQ learning algorithm to 2 agents games
☆33Updated 8 years ago
Alternatives and similar repositories for MinimaxQ-Learning
Users that are interested in MinimaxQ-Learning are comparing it to the libraries listed below
Sorting:
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆52Updated 7 months ago
- Curiosity-driven Exploration by Self-supervised Prediction☆145Updated 2 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Updated 6 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆132Updated 2 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆56Updated 3 years ago
- Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020☆55Updated last year
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆86Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆105Updated 5 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆107Updated 5 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆121Updated 5 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆58Updated 4 years ago
- [ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning☆32Updated 2 years ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆183Updated 2 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆148Updated 6 years ago
- Lightweight multi-agent gridworld Gym environment☆213Updated 2 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆84Updated 4 years ago
- An unofficial implementation for online decision transformer☆41Updated 3 years ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆89Updated 7 years ago
- [ICML 2019] Implementation of "Imitation Learning from Imperfect Demonstration"☆49Updated 6 years ago
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆95Updated 2 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 10 months ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆109Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆141Updated last year
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆137Updated 4 months ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆200Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆64Updated 2 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆62Updated 6 years ago
- Experiments with reinforcement learning and recurrent neural networks☆114Updated 2 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆51Updated 5 years ago