alexanderbaumann99 / PPO-AlgorithmsLinks
Experiments of the three PPO-Algorithms (PPO, clipped PPO, PPO with KL-penalty) proposed by John Schulman et al. on the 'Cartpole-v1' environment.
☆13Updated 4 years ago
Alternatives and similar repositories for PPO-Algorithms
Users that are interested in PPO-Algorithms are comparing it to the libraries listed below
Sorting:
- (ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolut…☆41Updated 2 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆72Updated last year
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆62Updated 2 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆75Updated 3 years ago
- PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm☆59Updated 3 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆207Updated last year
- [IEEE Transactions on Intelligent Transportation Systems] Curricular Subgoal for Inverse Reinforcement Learning☆17Updated 2 years ago
- ☆27Updated last year
- Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation☆137Updated 5 months ago
- Implementation of PPO Lagrangian in PyTorch☆54Updated 3 years ago
- Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning☆58Updated 3 years ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆74Updated 6 years ago
- Communication-efficient MARL for CACC☆27Updated 2 years ago
- Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.☆40Updated 4 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆83Updated 3 years ago
- jinxinglim / Game-Theoretical-Approaches-in-Multi-Agent-Reinforcement-Learning-Policy-Space-Response-Oracles☆16Updated 6 years ago
- Distributed Heuristic Multi-Agent Path Finding with Communication - ICRA 2021☆72Updated 4 years ago
- [Neurocomputing, 2023] Personalized Robotic Control via Constrained Multi-Objective Reinforcement Learning☆28Updated last year
- A novel Hierarchical Imitation Learning algorithm based on AIRL.☆23Updated 2 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆71Updated 2 years ago
- This is the official repository for the paper "Guided Exploration with Proximal Policy Optimization using a Single Demonstration", https:…☆19Updated 4 years ago
- DSAC; Distributional Soft Actor-Critic☆134Updated 10 months ago
- Deep recurrent Q learning on CartPole-v1 environment☆96Updated last year
- [IEEE T-ASE] [IROS 2022] Unified Automatic Control of Vehicular Systems With Reinforcement Learning☆42Updated 2 years ago
- ☆77Updated 2 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆96Updated 4 years ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆50Updated 5 years ago
- Code for paper Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety.☆20Updated 3 years ago
- A pytorch implementation of Constrained Reinforcement Learning Algorithm, including Constrained Soft Actor Critic (Soft Actor Critic Lagr…☆44Updated 2 years ago
- RLToolkit is a flexible and high-efficient reinforcement learning framework. Include implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG,…☆17Updated 2 years ago