SmallVagetable / reinforcement-learning
☆17Updated 5 years ago
Alternatives and similar repositories for reinforcement-learning:
Users that are interested in reinforcement-learning are comparing it to the libraries listed below
- My internship project in 𝖢𝖠𝖲𝖨𝖠. 🤗☆3Updated 5 years ago
- Alignment成为GPT类大模型微调的必须环节,深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架,30分钟上手强化学习编程。☆72Updated last year
- The state-of-the-art in multi-agent Reinforcement Learning is the MADDPG algorithm which utilises DDPG actor-critic neural networks where…☆26Updated 5 years ago
- ☆26Updated 4 years ago
- ☆20Updated 6 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆38Updated 6 years ago
- The code for maddpg using pytorch☆164Updated 4 years ago
- meta-MADDPG (Python implementation)☆18Updated 6 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆49Updated 6 years ago
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆28Updated 5 years ago
- A gymnasium environment for the job shop problem using the disjunctive graph approach☆21Updated last month
- Tensorflow implementation of an Actor Critic algorithm using a Pointer Network to solve the TSP (algorithm from Neural Combinatorial Opti…☆43Updated 7 years ago
- I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf☆31Updated 2 years ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆20Updated 3 years ago
- ☆24Updated 4 years ago
- RL-code for beginners. Enjoying!☆110Updated 4 years ago
- Multi-agent Reinforcement Learning Algorithms(COMA, VDN, QMIX)☆13Updated 4 years ago
- simple code to reinforcement learning☆19Updated 4 years ago
- ☆26Updated 2 years ago
- ☆45Updated 5 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation☆49Updated 4 years ago
- using recurrent networks(LSTM) to solve POMDPs☆35Updated 6 years ago
- ☆9Updated 2 years ago
- 🎾 Multi-Agent Proximal Policy Optimization approach to a competitive reinforcement learning problem☆19Updated 2 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆48Updated 4 years ago
- Paper collection of reinforcement learning based combinatorial optimization☆48Updated 3 years ago
- Applying Deep Q-learning for Global Routing☆120Updated 4 years ago
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- scalable multi agents reinforcement learning☆54Updated 6 years ago