anh-nn01 / Lunar-Lander-Double-Deep-Q-Networks
An AI agent that use Double Deep Q-learning to teach itself to land a Lunar Lander on OpenAI universe
☆16Updated 4 years ago
Alternatives and similar repositories for Lunar-Lander-Double-Deep-Q-Networks:
Users that are interested in Lunar-Lander-Double-Deep-Q-Networks are comparing it to the libraries listed below
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆81Updated 4 years ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆148Updated last year
- UAV Logistics Environment for Multi-Agent Reinforcement Learning / Unity ML-Agents / Unity 3D☆93Updated last year
- Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation☆108Updated 3 months ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆48Updated 5 years ago
- Implementation of the Nash Q-Learning algorithm to solve simple MARL problems with two agents.☆22Updated 2 years ago
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆45Updated 3 years ago
- implementation of MADDPG using PyTorch and multiagent-particle-envs☆36Updated 2 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆51Updated 3 years ago
- A simple example of how to implement vector based DDPG for MARL tasks using PyTorch and a ML-Agents environment.☆37Updated 6 years ago
- Project on multi agent reinforcement learning applied on patrolling agents☆39Updated 5 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆74Updated 2 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆77Updated 5 months ago
- multi agent RL for traffic light control in Sumo using distributed PPO☆122Updated last year
- Code for Weighted QMIX☆135Updated 4 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆127Updated last year
- ☆47Updated 5 years ago
- scalable multi agents reinforcement learning☆61Updated 7 years ago
- ☆96Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆50Updated 2 months ago
- ☆85Updated 3 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆89Updated last year
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆59Updated 4 years ago
- ☆11Updated 5 years ago
- PyTorch implementation of Constrained Policy Optimization☆53Updated 3 years ago
- QMIX implemented in TensorFlow 2☆17Updated 3 years ago
- A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm☆342Updated 4 years ago
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆48Updated 2 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆65Updated 3 years ago
- RLToolkit is a flexible and high-efficient reinforcement learning framework. Include implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG,…☆17Updated last year