sunnyswag / RL_notes_and_codes
学习强化学习过程中的笔记和代码
☆9Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for RL_notes_and_codes
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆28Updated 5 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆90Updated 3 years ago
- simple code to reinforcement learning☆19Updated 4 years ago
- Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3…☆15Updated 2 years ago
- ☆14Updated 3 years ago
- Experiments with transformer based RL algorithms☆22Updated 4 years ago
- Trading Robot based on LSTM-PPO☆24Updated 4 years ago
- ☆18Updated 4 years ago
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆27Updated 5 years ago
- Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C☆19Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- ☆33Updated 6 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆63Updated 5 years ago
- RLlib超参数详解(中文)☆14Updated 2 years ago
- Decision Transformer: A brand new Offline RL Pattern.☆34Updated 2 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- ☆17Updated 2 years ago
- Stable Baselines官方文档中文版☆93Updated 3 years ago
- The implement of GAIL with pytorch☆14Updated 4 years ago
- ☆20Updated 6 years ago
- ☆16Updated 2 years ago
- ☆13Updated 4 years ago
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆27Updated 4 years ago
- Experiments with reinforcement learning and recurrent neural networks☆113Updated last year
- A Multi-agent Learning Framework☆62Updated 3 years ago
- ☆158Updated last year
- Implement many Sparse Reward algorithms in Gym Fetch environment☆82Updated 4 years ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆55Updated 4 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆54Updated last year
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆53Updated last year