datawhalechina / rl-papers
rl-papers
☆47Updated 2 years ago
Alternatives and similar repositories for rl-papers:
Users that are interested in rl-papers are comparing it to the libraries listed below
- ☆63Updated last year
- ☆59Updated 3 months ago
- ☆102Updated 2 months ago
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆150Updated 9 months ago
- ☆41Updated last month
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- Transformer in RL for decision-making☆97Updated 2 years ago
- An easier PyTorch deep reinforcement learning library.☆207Updated 4 months ago
- ☆165Updated last year
- ☆12Updated 2 years ago
- NeurIPS 2024 DACER☆103Updated this week
- OpenAI团队的深度强化学习教程中文版☆29Updated 4 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆83Updated last year
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆69Updated 3 months ago
- ☆23Updated 2 years ago
- ☆123Updated 3 years ago
- 深度强化学习各算法介绍与Pytorch实现☆53Updated 9 months ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆166Updated last year
- ☆42Updated 3 years ago
- ☆90Updated 2 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆61Updated 10 months ago
- Solve BipedalWalkerHardcore-v2 with TD3☆88Updated last year
- Source Code☆182Updated last year
- DSAC; Distributional Soft Actor-Critic☆125Updated 2 months ago
- Python Implementation of Reinforcement Learning: An Introduction☆30Updated 5 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆128Updated last year
- Adversarial Reinforcement Learning papers (single-agent setting and multi-agent setting)☆70Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆169Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆107Updated 3 years ago
- Implement reinforcement learning algorithms in Pytorch☆33Updated 3 years ago