wetliu / dqn_pytorch
DQN with pytorch with on Breakout and SpaceInvaders
☆25Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for dqn_pytorch
- Play Breakout with DQN in pytorch.☆11Updated 2 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆69Updated last year
- ☆186Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆95Updated 3 years ago
- PPO, DDPG, SAC implementation on mujoco environment☆90Updated 2 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆84Updated last year
- This is the official implementation of Multi-Agent PPO.☆93Updated last year
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆50Updated 4 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆82Updated 4 years ago
- ☆88Updated 4 years ago
- A plotter for reinforcement learning (RL)☆207Updated 2 years ago
- ☆90Updated 3 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆161Updated last year
- Code for Weighted QMIX☆123Updated 4 years ago
- The implement of the policy gradient RL algorithm with pytorch☆37Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆117Updated 3 months ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆129Updated 10 months ago
- DQN to play Atari Pong☆111Updated 5 years ago
- A collection of offline reinforcement learning algorithms.☆157Updated 5 months ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆148Updated 4 months ago
- Model-based Offline Policy Optimization re-implement all by pytorch☆27Updated last year
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…☆130Updated 3 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆139Updated last year
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated last year
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆123Updated 6 months ago
- ☆158Updated last year
- Reinforcement Learning | tensorflow implementation of DQN, Dueling DQN and Double DQN performed on Atari Breakout☆88Updated 6 years ago
- The offcial implementation of "ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind" (ICLR 2022) .☆55Updated 2 weeks ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆155Updated 2 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆111Updated last year