bikcrum / ppo_transformerLinks
Implementation of Proximal Policy Optimization using Transformer
☆10Updated 2 years ago
Alternatives and similar repositories for ppo_transformer
Users that are interested in ppo_transformer are comparing it to the libraries listed below
Sorting:
- 深度强化学习各算法介绍与Pytorch实现☆70Updated last year
- PPO, DDPG, SAC implementation on mujoco environment☆119Updated 3 years ago
- ☆87Updated 3 months ago
- ☆105Updated 3 months ago
- Robust and safe deep reinforcement learning algorithms☆15Updated last year
- reinforcement learning algorithm for mapless navigation☆71Updated 4 years ago
- Implementation of Soft Actor-Critic with Hindsight Experience Replay☆20Updated 5 years ago
- ☆54Updated 4 months ago
- A Reinforcement Learning Project using PPO + LSTM☆96Updated 2 years ago
- A Reinforcement Learning Project using PPO + Transformer☆74Updated 2 years ago
- Exploring the performance of Prioritized Experience Replay (PER) with the DDPG+HER scheme on the Fetch Robotics Environemnt☆14Updated 4 years ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆404Updated 3 months ago
- ☆16Updated 3 years ago
- Official Github Repository for "Trust Region-Based Safe Distributional Reinforcement Learning for Multiple Constraints". (NeurIPS 2023)☆19Updated 11 months ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆88Updated 6 months ago
- A simple implementation of Generative Adversarial Imitation Learning with PyTorch☆169Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆54Updated 8 months ago
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆155Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- ☆114Updated 2 years ago
- ☆23Updated 2 years ago
- ☆63Updated 3 months ago
- NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms☆378Updated last year
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆20Updated last year
- Official code of the paper "Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts" at ICLR2024☆34Updated 11 months ago
- Official implementation for the UOF paper (algorithm & environment)☆33Updated 2 years ago
- Code for "Temporal Difference Learning for Model Predictive Control"☆468Updated last year
- Intelligent control algorithm and simulation environment.☆17Updated 5 years ago
- DSAC; Distributional Soft Actor-Critic☆132Updated 8 months ago
- SAC, PPO, A2C implementation on Mujoco environments : Humanoid-v4, Ant-v4, Cheetah-v4 . Includes reward manipulation.☆31Updated 2 months ago