qiwihui / spinningup
OpenAI团队的深度强化学习教程中文版
☆24Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for spinningup
- ☆121Updated 3 years ago
- ☆185Updated last year
- ☆88Updated 3 years ago
- ☆158Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆92Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆113Updated 8 months ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆109Updated 6 months ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆145Updated 6 months ago
- rl-papers☆42Updated last year
- 这是一个关于基于模型 的强化学习的资料,包括一些代码地址、paper、slide等。☆38Updated 4 years ago
- ☆89Updated 2 years ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆178Updated 2 years ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆55Updated 4 years ago
- ☆88Updated 3 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆83Updated last year
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆129Updated 10 months ago
- Code for Weighted QMIX☆123Updated 3 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆120Updated 5 months ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆69Updated 11 months ago
- Deep recurrent Q learning on CartPole-v1 environment☆73Updated 9 months ago
- Python Implementation of Reinforcement Learning: An Introduction☆28Updated 5 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆82Updated last year
- There will be updates later☆81Updated 5 years ago
- A collection of offline reinforcement learning algorithms.☆158Updated 5 months ago
- ☆40Updated 5 months ago
- ☆39Updated 2 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆64Updated last year
- Implement many Sparse Reward algorithms in Gym Fetch environment☆81Updated 4 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆43Updated 3 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆111Updated last year