CoderAT13 / BipedalWalkerHardcore-SACLinks

BipedalWalker & BipedalWalkerHardcore solved by SAC

☆25

Alternatives and similar repositories for BipedalWalkerHardcore-SAC

Users that are interested in BipedalWalkerHardcore-SAC are comparing it to the libraries listed below

Sorting:

Jonathan-Pearce / DDPG_PER
Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)
☆51Updated 5 months ago
ZhongZ-Wang / Model-Based-RL
这是一个关于基于模型的强化学习的资料，包括一些代码地址、paper、slide等。
☆44Updated 4 years ago
keep9oing / DRQN-Pytorch-CartPole-v1
Deep recurrent Q learning on CartPole-v1 environment
☆91Updated last year
LinghengMeng / LSTM-TD3
The implementation of LSTM-TD3.
☆82Updated 2 years ago
XinJingHao / TD3-BipedalWalkerHardcore-v2
Solve BipedalWalkerHardcore-v2 with TD3
☆91Updated 2 years ago
ZYunfeii / DRL_algorithm_library
This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.
☆102Updated 3 years ago
XinJingHao / SAC-Continuous-Pytorch
a clean and robust Pytorch implementation of SAC on continuous action space
☆83Updated 3 months ago
chauncygu / Multi-Agent-Constrained-Policy-Optimisation
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
☆181Updated last year
Jingliang-Duan / DSAC-v1
DSAC; Distributional Soft Actor-Critic
☆129Updated 5 months ago
ZifanWu / Coordinated-PPO
Code accompanying paper "Coordinated Proximal Policy Optimization"
☆11Updated 3 years ago
watakandai / hiro_pytorch
Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)
☆111Updated 4 years ago
AlgTUDelft / WCSAC
Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"
☆58Updated 2 years ago
rosewang2008 / rmaddpg
ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings
☆48Updated 5 years ago
BY571 / SAC_discrete
PyTorch implementation of the discrete Soft-Actor-Critic algorithm.
☆53Updated 3 years ago
AgrawalAmey / safe-explorer
Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]
☆72Updated 6 years ago
LxzGordon / Deep-Reinforcement-Learning-with-pytorch
Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…
☆91Updated 4 years ago
oxwhirl / facmac
☆101Updated 3 years ago
JohannesAck / MATD3implementation
Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…
☆86Updated 4 years ago
cardwing / Codes-for-RL-PER
A novel DDPG method with prioritized experience replay (IEEE SMC 2017)
☆50Updated 6 years ago
XinJingHao / PPO-Continuous-Pytorch
A clean and robust Pytorch implementation of PPO on continuous action space.
☆159Updated last year
ffelten / MASAC
Jax and Torch Multi-Agent SAC on PettingZoo API
☆86Updated 8 months ago
ammarhydr / SAC-Lagrangian
PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm
☆51Updated 3 years ago
akjayant / PPO_Lagrangian_PyTorch
Implementation of PPO Lagrangian in PyTorch
☆50Updated 2 years ago
BY571 / Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…
☆290Updated 4 years ago
jingranburangyongzhongwen / torchMARL
pytorch实现的一些MARL算法
☆67Updated 4 years ago
Wen2chao / RL-Algorithm
Hello😜
☆31Updated 4 years ago
isp1tze / MAProj
Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment
☆117Updated 2 years ago
gxywy / rl-plotter
A plotter for reinforcement learning (RL)
☆227Updated 3 years ago
DKuan / MADDPG_torch
The code for maddpg using pytorch
☆170Updated 4 years ago
dobro12 / CPO
Constrained Policy Optimization implementation on Safety Gym
☆28Updated 3 years ago