yz93 / Learn-to-Interpret-Atari-AgentsLinks
☆11Updated 5 years ago
Alternatives and similar repositories for Learn-to-Interpret-Atari-Agents
Users that are interested in Learn-to-Interpret-Atari-Agents are comparing it to the libraries listed below
Sorting:
- ☆201Updated 2 years ago
- Code for conservative Q-learning☆467Updated 4 years ago
- ☆359Updated 3 years ago
- ☆324Updated last year
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆63Updated 3 years ago
- Benchmarking RL generalization in an interpretable way.☆173Updated last month
- Gridworld for MARL experiments☆144Updated 4 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆226Updated last year
- The Implementation of "Machine Theory of Mind", ICML 2018☆26Updated 3 years ago
- A toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).☆32Updated 3 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆205Updated 3 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆59Updated 3 years ago
- Conservative Q Learning on top of SAC☆132Updated 3 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆79Updated 3 years ago
- This is a minimal example to demonstrate how multi-agent reinforcement learning with differentiable communication channels and centralize…☆43Updated 2 years ago
- ☆44Updated 4 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆155Updated 4 years ago
- Keeping track of RL experiments☆165Updated 3 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆268Updated 5 years ago
- ☆52Updated 5 years ago
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆559Updated 2 years ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆236Updated 2 years ago
- Code for the paper "Phasic Policy Gradient"☆267Updated 2 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆139Updated last year
- ☆114Updated 2 years ago
- ☆246Updated last year
- OpenAI Gym wrapper for ViZDoom enviroments☆70Updated 4 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆190Updated 3 years ago
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆321Updated last year
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆110Updated 2 years ago