lukeluocn / dqn-breakout
Play Breakout with DQN in pytorch.
☆11Updated 3 years ago
Alternatives and similar repositories for dqn-breakout:
Users that are interested in dqn-breakout are comparing it to the libraries listed below
- DQN with pytorch with on Breakout and SpaceInvaders☆25Updated 5 years ago
- A plotter for reinforcement learning (RL)☆218Updated 3 years ago
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆49Updated 4 years ago
- ☆159Updated last year
- Re-implementations of SOTA RL algorithms.☆129Updated last year
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆149Updated last year
- ☆97Updated 4 years ago
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆22Updated last month
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆72Updated 2 years ago
- PPO, DDPG, SAC implementation on mujoco environment☆96Updated 3 years ago
- The code for maddpg using pytorch☆165Updated 4 years ago
- Must-read papers on Reinforcement Learning (RL)☆45Updated 4 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆71Updated 2 months ago
- CS285 Homework☆26Updated 4 years ago
- Actor Critic model to play Cartpole game☆52Updated 6 years ago
- This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. P…☆34Updated last month
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆122Updated 6 months ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆164Updated last year
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆19Updated this week
- DQN to play Atari Pong☆113Updated 6 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆128Updated last year
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆108Updated last year
- ☆193Updated last year
- Half Field Offense in Robocup 2D Soccer with reinforcement learning☆34Updated 3 years ago
- DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DD…☆328Updated last year
- 天授中文文档☆55Updated 2 months ago
- ☆91Updated 4 years ago
- The offcial implementation of "ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind" (ICLR 2022) .☆58Updated 3 months ago
- ☆122Updated 3 years ago
- ☆12Updated 2 years ago