YangRui2015 / 2048_envLinks
2048 environment for Reinforcement Learning and DQN algorithm
☆40Updated 3 years ago
Alternatives and similar repositories for 2048_env
Users that are interested in 2048_env are comparing it to the libraries listed below
Sorting:
- ☆38Updated 2 years ago
- ☆165Updated last year
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆52Updated 4 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆88Updated 4 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆74Updated 2 years ago
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- OpenAI团队的深度强化学习教程中文版☆29Updated 5 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 3 years ago
- ☆34Updated 4 years ago
- DQN to play Atari Pong☆114Updated 6 years ago
- ☆16Updated 2 years ago
- RLlib超参数详解(中文)☆18Updated 3 years ago
- ☆124Updated 3 years ago
- 天授中文文档☆58Updated 6 months ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆172Updated last year
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆131Updated last year
- My DRL library with tensorflow1.14 based on openai spinning-up☆61Updated 4 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆93Updated 4 years ago
- ☆32Updated 2 years ago
- ☆25Updated 2 years ago
- Re-implementations of SOTA RL algorithms.☆133Updated last year
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆51Updated 9 months ago
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆26Updated 2 months ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆21Updated 4 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- ☆52Updated 6 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆86Updated 2 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- ☆42Updated 2 years ago