NiranjanBhujel / Pendulum_PPOLinks
Implementation of Proximal Policy Optimization (PPO) for continuous action space (`Pendulum-v1` from gym) using tensorflow2.x and pytorch.
☆10Updated 3 years ago
Alternatives and similar repositories for Pendulum_PPO
Users that are interested in Pendulum_PPO are comparing it to the libraries listed below
Sorting:
- a clean and robust Pytorch implementation of SAC on continuous action space☆89Updated 9 months ago
- A clean and robust Pytorch implementation of SAC on discrete action space☆42Updated last year
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆54Updated 10 months ago
- Implement reinforcement learning algorithms in Pytorch☆34Updated 4 years ago
- Hybrid Action PPO in stable-baselines3☆15Updated last year
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆60Updated 3 years ago
- PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm☆61Updated 3 years ago
- ☆48Updated 3 years ago
- [ICDE 2023] Exploring both Individuality and Cooperation for Air-Ground Spatial Crowdsourcing by Multi-Agent Deep Reinforcement Learning☆24Updated 3 years ago
- implementation of MADDPG using PettingZoo and PyTorch☆163Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆218Updated last year
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆431Updated last month
- Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation☆141Updated 6 months ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆169Updated last year
- Jax and Torch Multi-Agent SAC on PettingZoo API☆99Updated last year
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆57Updated 3 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆154Updated last year
- my code for paper Parameterized-DQN☆25Updated 4 years ago
- ☆62Updated 5 years ago
- PyTorch implementation of Constrained Policy Optimization☆56Updated 4 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆96Updated 2 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆92Updated 5 years ago
- ☆55Updated 7 months ago
- ☆20Updated 2 years ago
- ☆106Updated 6 months ago
- Simple and efficient implementation of DQN DDPG TD3 SAC PPO MADDPG MATD3 MASAC MAAC IPPO MAPPO HAPPO MAT MORL☆147Updated last month
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆26Updated 3 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆64Updated 4 years ago
- DSAC; Distributional Soft Actor-Critic☆136Updated 11 months ago
- Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships☆65Updated 3 years ago