cathywu / rllab-multiagent
☆11Updated 2 years ago
Alternatives and similar repositories for rllab-multiagent:
Users that are interested in rllab-multiagent are comparing it to the libraries listed below
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆27Updated 5 years ago
- Environments with IC3Net paper☆12Updated 6 years ago
- FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.☆23Updated 5 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 5 years ago
- FEN Code☆37Updated 5 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 5 years ago
- Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings☆95Updated 6 years ago
- TD-Regularized Actor-Critic Methods☆34Updated 5 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Updated 2 years ago
- Robust policy search algorithms which train on model ensembles☆28Updated 8 years ago
- Deep Reinforcement Learning for Multi Agent Soccer☆17Updated 8 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆83Updated 6 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- PyTorch implementation of CommNet☆36Updated 7 years ago
- NGSIM Driving RL/Imitation learning environment compatible with rllab☆12Updated 6 years ago
- Collaborative Deep Reinforcement Learning☆32Updated 7 years ago
- A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env☆70Updated 7 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- Jointly learning policies and latent representations for driver behavior.☆15Updated 7 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Updated 2 years ago
- Code for training policies based on paper Coordinated Multi-Agent Imitation Learning☆26Updated 7 years ago
- reimplementation of the ddpg algorithm using tensorflow☆38Updated 8 years ago
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Updated 7 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 5 years ago
- ☆35Updated 6 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26Updated 4 years ago
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆87Updated 6 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆47Updated 6 years ago
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆20Updated 6 years ago