SmallVagetable / reinforcement-learning

☆17

Alternatives and similar repositories for reinforcement-learning:

Users that are interested in reinforcement-learning are comparing it to the libraries listed below

LovelyBuggies / Python_MADDPG_SC2LE
My internship project in 𝖢𝖠𝖲𝖨𝖠. 🤗
☆3Updated 5 years ago
zhaoyingjun / general
Alignment成为GPT类大模型微调的必须环节，深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架，30分钟上手强化学习编程。
☆72Updated last year
Remtasya / Distributional-Multi-Agent-Actor-Critic-Reinforcement-Learning-MADDPG-Tennis-Environment
The state-of-the-art in multi-agent Reinforcement Learning is the MADDPG algorithm which utilises DDPG actor-critic neural networks where…
☆26Updated 5 years ago
lucifer2859 / meta-RL
☆26Updated 4 years ago
onebula / Reinforcement_Learning_in_Action
☆20Updated 6 years ago
adithya-subramanian / Multi_Agent_Soft_Actor_Critic
A Pytorch Implementation of Multi Agent Soft Actor Critic
☆38Updated 6 years ago
DKuan / MADDPG_torch
The code for maddpg using pytorch
☆164Updated 4 years ago
liyiying / meta-MADDPG
meta-MADDPG (Python implementation)
☆18Updated 6 years ago
cardwing / Codes-for-RL-PER
A novel DDPG method with prioritized experience replay (IEEE SMC 2017)
☆49Updated 6 years ago
AndyYue1893 / Hands-On-Reinforcement-Learning-With-Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
☆28Updated 5 years ago
Alexander-Nasuta / graph-jsp-env
A gymnasium environment for the job shop problem using the disjunctive graph approach
☆21Updated last month
aurelienbibaut / Actor_CriticPointer_Network-TSP
Tensorflow implementation of an Actor Critic algorithm using a Pointer Network to solve the TSP (algorithm from Neural Combinatorial Opti…
☆43Updated 7 years ago
RvuvuzelaM / self-attention-ppo-pytorch
I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf
☆31Updated 2 years ago
xuemei-ye / maddpg-mpe
Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).
☆20Updated 3 years ago
zachary2wave / Torch-rl
☆24Updated 4 years ago
Teacher-Guo / RL_code
RL-code for beginners. Enjoying!
☆110Updated 4 years ago
Abluceli / Multi-agent-Reinforcement-Learning-Algorithms
Multi-agent Reinforcement Learning Algorithms(COMA, VDN, QMIX)
☆13Updated 4 years ago
feidieufo / RL-Implementation
simple code to reinforcement learning
☆19Updated 4 years ago
ling-pan / RES
☆26Updated 2 years ago
cristianoc20 / RL_learning
☆45Updated 5 years ago
ChengTsang / PPO-clip-and-PPO-penalty-on-Atari-Domain
Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty
☆56Updated 6 years ago
ShaniGam / RL-GAN
Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation
☆49Updated 4 years ago
HaiyinPiao / pytorch-a2clstm-DRQN
using recurrent networks(LSTM) to solve POMDPs
☆35Updated 6 years ago
kazuhirobben / MADQN_for_Global_Routing
☆9Updated 2 years ago
terran6 / mappo-competitive-reinforcement
🎾 Multi-Agent Proximal Policy Optimization approach to a competitive reinforcement learning problem
☆19Updated 2 years ago
Jonathan-Pearce / DDPG_PER
Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)
☆48Updated 4 years ago
haiguanl / RLCO-Papers
Paper collection of reinforcement learning based combinatorial optimization
☆48Updated 3 years ago
haiguanl / DQN_GlobalRouting
Applying Deep Q-learning for Global Routing
☆120Updated 4 years ago
hangsz / reinforcement_learning
[动手学强化学习]系列，基于pytorch。
☆54Updated 3 years ago
livey / scalable_maddpg
scalable multi agents reinforcement learning
☆54Updated 6 years ago