jw1401 / PPO-Tensorflow-2.0

Proximal Policy Optimization with Tensorflow 2.0

☆30

Related projects: ⓘ

Bigpig4396 / PyTorch-Counterfactual-Multi-Agent-Policy-Gradients-COMA
☆70Updated 4 years ago
deligentfool / dqn_zoo
The implement of all kinds of dqn reinforcement learning with Pytorch
☆86Updated 3 years ago
cyoon1729 / Multi-agent-reinforcement-learning
Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG
☆63Updated 5 years ago
namidairo777 / Distributed-MADDPG
Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.
☆98Updated 3 years ago
wsjeon / maddpg-rllib
MADDPG in Ray/RLlib
☆50Updated 4 years ago
hsvgbkhgbv / SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …
☆110Updated 2 years ago
TianhongDai / distributed-ppo
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
☆62Updated 6 years ago
011235813 / hierarchical-marl
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery
☆91Updated 2 years ago
ac-93 / soft-actor-critic
Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.
☆92Updated 4 years ago
ZhongZ-Wang / Model-Based-RL
这是一个关于基于模型的强化学习的资料，包括一些代码地址、paper、slide等。
☆34Updated 4 years ago
alirezakazemipour / Discrete-SAC-PyTorch
PyTorch implementation of discrete version of Soft Actor-Critic.
☆27Updated 3 years ago
Felhof / DiscreteSAC
☆39Updated 2 years ago
matteokarldonati / Counterfactual-Multi-Agent-Policy-Gradients
PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."
☆52Updated 4 years ago
Sonkyunghwan / QTRAN
There will be updates later
☆79Updated 5 years ago
LilTwo / DRL-using-PyTorch
PyTorch implementation of Deep Reinforcement Algorithm
☆30Updated 2 years ago
livey / scalable_maddpg
scalable multi agents reinforcement learning
☆53Updated 6 years ago
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆128Updated 5 years ago
Jonathan-Pearce / DDPG_PER
Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)
☆44Updated 3 years ago
rpatrik96 / AttA2C
Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning
☆25Updated 4 years ago
KornbergFresnel / CommNet
an implementation of CommNet
☆29Updated 6 years ago
oxwhirl / wqmix
Code for Weighted QMIX
☆119Updated 3 years ago
Bigpig4396 / PyTorch-Deep-Recurrent-Q-Learning-DRQN
☆41Updated 5 years ago
polixir / OfflineRL
A collection of offline reinforcement learning algorithms.
☆153Updated 3 months ago
watakandai / hiro_pytorch
Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)
☆87Updated 3 years ago
011235813 / cm3
Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning
☆54Updated 2 years ago
XinJingHao / PPO-Continuous-Pytorch
A clean and robust Pytorch implementation of PPO on continuous action space.
☆115Updated 3 months ago
nikhilbarhate99 / TD3-PyTorch-BipedalWalker-v2
Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment
☆101Updated 5 years ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆133Updated last month
wwxFromTju / maddpg-tf
use tensorflow to implement the MADDPG(simple_tag)
☆17Updated 6 years ago
apourchot / CEM-RL
Combining Evolutionary Algorithms and deep RL in various ways
☆98Updated 3 years ago