PyTorch implementation of PPO algorithm
☆22Dec 26, 2019Updated 6 years ago
Alternatives and similar repositories for PyTorch-PPO
Users that are interested in PyTorch-PPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Nov 12, 2023Updated 2 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago
- [RAL 2023] transformer + reinforcement learning for navigation + POMPD☆15Jul 19, 2023Updated 2 years ago
- ☆17Sep 23, 2022Updated 3 years ago
- Implementation of papers in 101 lines of code.☆18Nov 12, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Windows 💻 RobustVideoMatting with ONNXRuntime/MNN/TNN C++/Python☆12Mar 10, 2022Updated 4 years ago
- The source code of team 🥇Schaferct in 2nd Bandwidth Prediction of MMSys'24.☆15May 13, 2024Updated last year
- ICRA 2024☆16Mar 13, 2024Updated 2 years ago
- Electroplating simulation environment☆20Sep 26, 2024Updated last year
- Reinforcement learning for deep brain stimulation (DBS) modeling☆26Feb 17, 2022Updated 4 years ago
- A simple implementation of the LRFU cache eviction policy in Python.☆10Feb 1, 2015Updated 11 years ago
- We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting☆12Mar 9, 2018Updated 8 years ago
- Submission code of UEFDRL team to NeurIPS 2019 MineRL challenge (5th place)☆13Nov 13, 2020Updated 5 years ago
- This repository contains the code for paper Li, Ran, et al. "Decision-oriented learning for future power system decision-making under unc…☆28Apr 13, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Describing How to Enable OpenVINO Execution Provider for ONNX Runtime☆20Jun 29, 2020Updated 5 years ago
- Source code for the paper titled: "Unlocking the full potential of smart charging: Addressing paused and delayed charging problems in ele…☆11May 22, 2024Updated last year
- Reward Guided Latent Consistency Distillation☆27Oct 9, 2024Updated last year
- ☆18Jul 7, 2020Updated 5 years ago
- Wolf Pack Algorithm☆15May 16, 2021Updated 4 years ago
- Code and Data for Real-time Human-Centric Segmentation for Complex Video Scenes☆17Feb 8, 2024Updated 2 years ago
- Backup repo for "MD-VQA: Multi-Dimensional Quality Assessment for UGC Live Videos"☆14Feb 16, 2024Updated 2 years ago
- The goal of this project is to develop a program for planetary soft landings using lossless convexification of non convex control bounds.☆12Mar 25, 2022Updated 4 years ago
- Companion code to the paper "Transient Stability of Droop-Controlled Inverter Networks with Operating Constraints".☆10Aug 27, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This is the code project for optimal bidding staregy of virtual poewr plant considering uncertainty with IGDT and DRO☆12Mar 18, 2024Updated 2 years ago
- ☆11Nov 29, 2021Updated 4 years ago
- Fast Online Adaptive Neural MPC via Meta-Learning☆25Feb 23, 2026Updated last month
- Code implementation of “Flexible Coordination of Wind Generators and Energy Storages in joint Energy and Frequency Regulation Market“☆11Sep 26, 2023Updated 2 years ago
- ☆10Mar 31, 2021Updated 4 years ago
- Program used to control and configure some of the ENSTA Bretagne UGVs, USVs, UUVs, UAVs used in WRSC, SAUC-E and euRathlon/ERL competitio…☆11Dec 13, 2025Updated 3 months ago
- A multi-robot version of the ROS explore package☆15May 20, 2020Updated 5 years ago
- IPython notebooks that illustrate the Pyomo optimization modeling software☆15Aug 5, 2015Updated 10 years ago
- M.Sc Thesis: Robotic Navigation under Partial Observability with Actor-Critic Methods DDPG, SAC, PPO. Environment, Lidar, and Kinematic M…☆20Jun 9, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Application to test inference frameworks for Android☆17Nov 28, 2022Updated 3 years ago
- 基于Eigen运算库的深度学习框架(支持CUDA加速)☆18Jan 12, 2022Updated 4 years ago
- ☆16May 11, 2017Updated 8 years ago
- DistFlow Safe Reinforcement Learning Algorithm for Voltage Magnitude Regulation in Distribution Networks☆13Jul 9, 2025Updated 8 months ago
- ☆18May 21, 2016Updated 9 years ago
- Simulation and analysis of a typical European electricity distribution network with variations in EV adoption and PV penetration rate☆11Jul 19, 2020Updated 5 years ago
- TaskMet Task-driven Metric Learning for Model Learning☆19Feb 9, 2024Updated 2 years ago