lukeluocn / dqn-breakout
Play Breakout with DQN in pytorch.
☆11Updated 3 years ago
Alternatives and similar repositories for dqn-breakout:
Users that are interested in dqn-breakout are comparing it to the libraries listed below
- DQN with pytorch with on Breakout and SpaceInvaders☆25Updated 5 years ago
- A plotter for reinforcement learning (RL)☆217Updated 3 years ago
- Re-implementations of SOTA RL algorithms.☆129Updated last year
- This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. P…☆31Updated last week
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆149Updated last year
- ☆122Updated 3 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆86Updated 4 years ago
- Must-read papers on Reinforcement Learning (RL)☆41Updated 4 years ago
- RLA is a tool for managing your RL experiments automatically☆71Updated last year
- Assignments of Computer Science courses in SYSU☆29Updated last year
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆107Updated last year
- ☆91Updated 4 years ago
- The proceedings of top conference in 2023 on the topic of Reinforcement Learning (RL), including: AAAI, IJCAI, NeurIPS, ICML, ICLR, ICRA,…☆30Updated last year
- Meta RL codebase for Unstable Baselines☆21Updated 2 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆140Updated last year
- Benchmarked implementations of Offline Multi-Agent RL Algorithms based on PyMARL codebase.☆23Updated 3 months ago
- ☆97Updated 3 years ago
- DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DD…☆326Updated last year
- Google Research Football MARL Benchmark and Research Toolkit☆37Updated 8 months ago
- The offcial implementation of "ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind" (ICLR 2022) .☆58Updated 2 months ago
- Code for Weighted QMIX☆126Updated 4 years ago
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆49Updated 4 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆85Updated last year
- Official implementation of Neural Episodic Control with State Abstraction☆13Updated last year
- ☆38Updated 2 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆72Updated 2 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆128Updated last year
- Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.☆206Updated last year
- Play Atari(Breakout) Game by DRL - DQN, Noisy DQN and A3C☆12Updated 4 years ago