lukeluocn / dqn-breakout
Play Breakout with DQN in pytorch.
☆11Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for dqn-breakout
- DQN with pytorch with on Breakout and SpaceInvaders☆25Updated 5 years ago
- PPO, DDPG, SAC implementation on mujoco environment☆90Updated 2 years ago
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆50Updated 4 years ago
- A plotter for reinforcement learning (RL)☆207Updated 2 years ago
- DQN to play Atari Pong☆111Updated 5 years ago
- ☆118Updated 3 months ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆107Updated last year
- ☆97Updated 3 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆82Updated 4 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆95Updated 3 years ago
- ☆158Updated last year
- Paper Collection for Imitation Learning in RL.☆134Updated 2 years ago
- A collection of offline reinforcement learning algorithms.☆157Updated 5 months ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆102Updated 2 years ago
- ☆186Updated last year
- Play Atari(Breakout) Game by DRL - DQN, Noisy DQN and A3C☆13Updated 4 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆129Updated 10 months ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆117Updated 3 months ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆69Updated last year
- Reinforcement Learning | tensorflow implementation of DQN, Dueling DQN and Double DQN performed on Atari Breakout☆88Updated 6 years ago
- The code for maddpg using pytorch☆162Updated 4 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆139Updated last year
- The offcial implementation of "ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind" (ICLR 2022) .☆55Updated 2 weeks ago
- Pytorch solutions for UC Berkeley's cs285 assignments☆121Updated 2 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆127Updated last year
- ☆88Updated 4 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated last year
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆84Updated last year
- This is the official implementation of Multi-Agent PPO.☆93Updated last year
- Tutorial for Reinforcement Learning☆172Updated 2 years ago