A clean and robust Pytorch implementation of PPO on Discrete action space
☆72Jun 8, 2024Updated last year
Alternatives and similar repositories for PPO-Discrete-Pytorch
Users that are interested in PPO-Discrete-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A clean and robust Pytorch implementation of TD3 on continuous action space☆32Jun 8, 2024Updated last year
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year
- A clean and robust Pytorch implementation of SAC on discrete action space☆43Oct 23, 2024Updated last year
- A clean and robust Pytorch implementation of PPO on continuous action space.☆175Jun 8, 2024Updated last year
- a clean and robust Pytorch implementation of SAC on continuous action space☆94Apr 13, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Using deep reinforcement learning to play Snake game. The used algorithm is PPO for discrete! It has the brilliant performance in the fi…☆34Nov 3, 2025Updated 6 months ago
- ☆40Nov 17, 2021Updated 4 years ago
- Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.☆1,475Mar 29, 2023Updated 3 years ago
- Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.☆12Aug 20, 2024Updated last year
- solve pursuit-evasion problem with multi-agent deep reinforcement learning☆13Sep 9, 2020Updated 5 years ago
- PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method☆28Feb 16, 2021Updated 5 years ago
- Implementation of some algorithms for text clustering☆14Sep 5, 2018Updated 7 years ago
- Here is our algorithm for Pursuit Problem based on the Distributed Reinforcement Learning for Cooperative Multi-robot Pursuit☆10Apr 17, 2019Updated 7 years ago
- This repository contains the Python implementation of our submitted paper titled "Deep Reinforcement Learning for Joint Trajectory and Co…☆15Jun 29, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- SEQ2SEQ model with Attention mechanism for QA also for NMT +(Generating text using LSTM network)☆13May 24, 2019Updated 7 years ago
- ☆11Mar 28, 2026Updated 2 months ago
- A clean Pytorch implementation of DDPG on continuous action space.☆31Jun 8, 2024Updated last year
- Reinfocement Learning based Condition-oriented Maintenance Scheduling for Flow Line Systems☆13Sep 30, 2021Updated 4 years ago
- Collection of OpenAI parametrized action-space environments.☆69Mar 19, 2025Updated last year
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆29Jul 24, 2023Updated 2 years ago
- ☆22Mar 7, 2021Updated 5 years ago
- A decentralized and privacy preserving Mobile Crowdsensing system based on Blockchain Oracles.☆10May 23, 2021Updated 5 years ago
- Material associated with Physics Report "Data science applications to string theory"☆12Jun 20, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- The visualization of a multi-agent reinforcement learning (MARL)-based strategy with efficient exploration strategy.☆20Oct 28, 2022Updated 3 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆37Sep 19, 2021Updated 4 years ago
- ☆11Apr 26, 2021Updated 5 years ago
- 电子科技大学硕士毕设☆10Jun 4, 2019Updated 6 years ago
- An PyTorch implementation of "Importance Weighted Actor-Learner Architectures" https://arxiv.org/abs/1802.01561☆12Jan 6, 2021Updated 5 years ago
- ☆12Apr 17, 2023Updated 3 years ago
- ☆10Apr 2, 2023Updated 3 years ago
- An AI agent that uses Deep Q-Networks and the DDPG algorithm to learn trajectory optimization in a customized gym environment.☆13Oct 30, 2021Updated 4 years ago
- ☆12Apr 26, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Trust Management for Vehicular Networks☆11Aug 6, 2025Updated 9 months ago
- multi-workflow scheduling☆15Dec 30, 2021Updated 4 years ago
- ☆15Mar 26, 2024Updated 2 years ago
- A Simple RealTime PathFinding Robot Based on Implementation of DQN Algoriththm on Xilinx Zynq ARM Cortex-A Hard Processor (My B.Sc. Thesi…☆12Dec 7, 2019Updated 6 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆99May 21, 2023Updated 3 years ago
- ☆10Dec 10, 2021Updated 4 years ago
- Blockchain Based Approach for Trust Management in Intelligent Transportation Systems with Smart Contracts☆13Jul 19, 2022Updated 3 years ago