Proximal policy optimization in PyTorch. Easy to read and understand.
☆51Oct 30, 2020Updated 5 years ago
Alternatives and similar repositories for ppo-pytorch
Users that are interested in ppo-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch☆2,343Jul 9, 2024Updated last year
- Trading Robot based on LSTM-PPO☆29Dec 27, 2019Updated 6 years ago
- ☆20Apr 10, 2018Updated 8 years ago
- Simple example of DQN for Unity using Keras☆13Dec 22, 2018Updated 7 years ago
- Deep RL agents with PyTorch☆36Sep 25, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation of modular composition network from https://arxiv.org/pdf/1711.11289.pdf☆25Dec 30, 2017Updated 8 years ago
- Random Network Distillation(RND) algo in Pytorch☆51Feb 26, 2019Updated 7 years ago
- Simple change of a3c to a2c☆15Jun 18, 2017Updated 8 years ago
- ☆10Aug 17, 2018Updated 7 years ago
- Pytorch implementation of DreamerV2: MASTERING ATARI WITH DISCRETE WORLD MODELS☆53Jan 28, 2022Updated 4 years ago
- Simulink Reference example for modeling smart trucks with the intelligence to form a platoon based on certain criteria.☆26Nov 26, 2019Updated 6 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆256May 3, 2020Updated 6 years ago
- Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.☆99Feb 1, 2020Updated 6 years ago
- An OpenAI Gym interface to The Legend of Zelda on the NES.☆26Jul 7, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆28Dec 19, 2022Updated 3 years ago
- This is MPE-pytorch, fix some bugs.☆11Apr 26, 2020Updated 6 years ago
- A SUMO simulator for platooning maneuvers in mixed traffic scenarios☆33Mar 6, 2019Updated 7 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆56Apr 27, 2020Updated 6 years ago
- Official Repository for Can Language Models be Instructed to Protect Personal Information?☆13Oct 8, 2023Updated 2 years ago
- Bidirectionally-Coordinated Net Implements with PyTorch 1.0☆15Apr 10, 2019Updated 7 years ago
- code for paper: [1] M. Hu, X. Wang, Y. Bian, D. Cao, and H. Wang, “Disturbance Observer-Based Cooperative Control of Vehicle Platoons Sub…☆34Jan 28, 2024Updated 2 years ago
- A traffic control system which counts the number of vehicles and detects passage of any emergency vehicle. This will help in dynamic traf…☆11Apr 13, 2017Updated 9 years ago
- Implementation of PPO in Pytorch☆41Dec 6, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic.☆20May 18, 2020Updated 5 years ago
- Part of a research scholarship. I built a basic 2d driving sim with simulated lidar data to train Deep Q Neural Network. So far after abo…☆11Feb 15, 2017Updated 9 years ago
- Code for "Auxiliary Tasks Speed Up Learning PointGoal Navigation"☆19Nov 27, 2020Updated 5 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆147Mar 12, 2023Updated 3 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,898May 29, 2022Updated 3 years ago
- Minimalistic implementation of Vanilla Policy Gradient with PyTorch☆18Jun 18, 2019Updated 6 years ago
- This is pytorch version of maddpg.☆10Jun 23, 2020Updated 5 years ago
- A multi-task deep reinforcement learning model for trading futures contracts using the Interactive Brokers API and TensorFlow☆15Feb 8, 2023Updated 3 years ago
- Example implemention of the Proximal Policy Optimization algorithm☆17Jul 25, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Mar 21, 2021Updated 5 years ago
- ☆10Apr 24, 2021Updated 5 years ago
- A simple baseline for mountain-car @ gym☆12Jan 15, 2020Updated 6 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Nov 15, 2019Updated 6 years ago
- Semantic-Aware Fine-Grained Correspondence, at ECCV 2022 (Oral)☆14Oct 29, 2022Updated 3 years ago
- Online Resource Repository: Datasets, Simulation Platforms, and Empirical Research on Emerging Mixed Traffic of Automated Vehicles and Hu…☆16Nov 29, 2023Updated 2 years ago
- similarity between graph nodes based on local information with PySpark☆10Sep 30, 2022Updated 3 years ago