qiwihui / spinningupLinks
OpenAI团队的深度强化学习教程中文版
☆29Updated 5 years ago
Alternatives and similar repositories for spinningup
Users that are interested in spinningup are comparing it to the libraries listed below
Sorting:
- ☆165Updated last year
- rl-papers☆47Updated 2 years ago
- ☆124Updated 3 years ago
- RLlib超参数详解(中文)☆18Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆129Updated 4 months ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆56Updated 3 years ago
- ☆41Updated 3 years ago
- ☆51Updated 3 weeks ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆117Updated 2 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- ☆66Updated last year
- ☆99Updated 3 years ago
- 天授中文文档☆58Updated 6 months ago
- Implement reinforcement learning algorithms in Pytorch☆33Updated 4 years ago
- Python Implementation of Reinforcement Learning: An Introduction☆31Updated 5 years ago
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆28Updated 5 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆53Updated 3 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- DQN examples codes in chapter 4☆43Updated 2 years ago
- ☆209Updated 2 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆158Updated last year
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆177Updated last year
- 多智能体即时策略对抗方法与实践 苏炯铭 刘鸿福 陈少飞 项凤涛 编著 科学出版社 2019.11 随书代码☆31Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 3 years ago
- pytorch实现的一些MARL算法☆67Updated 4 years ago
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…☆132Updated 4 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆90Updated 2 years ago
- Code for "ALMA: Hierarchical Learning for Composite Multi-Agent Tasks" NeurIPS 2022☆29Updated 2 years ago
- TD3 in Pytorch☆34Updated 3 years ago
- ☆42Updated 2 years ago