Proximal policy optimization in PyTorch. Easy to read and understand.
☆51Oct 30, 2020Updated 5 years ago
Alternatives and similar repositories for ppo-pytorch
Users that are interested in ppo-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of intrinsic curiosity module with proximal policy optimization☆55Dec 20, 2018Updated 7 years ago
- Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch☆2,356Jul 9, 2024Updated last year
- ☆20Apr 10, 2018Updated 8 years ago
- Simple example of DQN for Unity using Keras☆13Dec 22, 2018Updated 7 years ago
- Code for Generalization Guarantees for (Multi-Modal) Imitation Learning☆11Jul 14, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Deep RL agents with PyTorch☆35Sep 25, 2021Updated 4 years ago
- Random Network Distillation(RND) algo in Pytorch☆51Feb 26, 2019Updated 7 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆148Jan 12, 2019Updated 7 years ago
- Simple change of a3c to a2c☆15Jun 18, 2017Updated 8 years ago
- ☆10Aug 17, 2018Updated 7 years ago
- Building an Intrusion detection system using KDD Cup 99 Dataset☆14May 11, 2020Updated 6 years ago
- Pytorch implementation of DreamerV2: MASTERING ATARI WITH DISCRETE WORLD MODELS☆53Jan 28, 2022Updated 4 years ago
- Simulink Reference example for modeling smart trucks with the intelligence to form a platoon based on certain criteria.☆26Nov 26, 2019Updated 6 years ago
- Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.☆99Feb 1, 2020Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- An OpenAI Gym interface to The Legend of Zelda on the NES.☆26May 18, 2026Updated 3 weeks ago
- ☆28Dec 19, 2022Updated 3 years ago
- This repository contains the source code pytorch realization of PPO for solving openai gym enviroments.☆21Oct 9, 2020Updated 5 years ago
- A SUMO simulator for platooning maneuvers in mixed traffic scenarios☆33Mar 6, 2019Updated 7 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆56Apr 27, 2020Updated 6 years ago
- Bidirectionally-Coordinated Net Implements with PyTorch 1.0☆15Apr 10, 2019Updated 7 years ago
- code for paper: [1] M. Hu, X. Wang, Y. Bian, D. Cao, and H. Wang, “Disturbance Observer-Based Cooperative Control of Vehicle Platoons Sub…☆35Jan 28, 2024Updated 2 years ago
- A traffic control system which counts the number of vehicles and detects passage of any emergency vehicle. This will help in dynamic traf…☆11Apr 13, 2017Updated 9 years ago
- Wrapper for OpenAI Retro envs for parallel execution☆27Dec 22, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementation of PPO in Pytorch☆41Dec 6, 2017Updated 8 years ago
- Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic.☆20May 18, 2020Updated 6 years ago
- Development and evaluation of different approaches for fibre tracking of diffusion weighted MRI data.☆10May 9, 2022Updated 4 years ago
- Part of a research scholarship. I built a basic 2d driving sim with simulated lidar data to train Deep Q Neural Network. So far after abo…☆11Feb 15, 2017Updated 9 years ago
- Code for "Auxiliary Tasks Speed Up Learning PointGoal Navigation"☆20Nov 27, 2020Updated 5 years ago
- PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.☆12Nov 27, 2024Updated last year
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,900May 29, 2022Updated 4 years ago
- Minimalistic implementation of Vanilla Policy Gradient with PyTorch☆18Jun 18, 2019Updated 6 years ago
- This is pytorch version of maddpg.☆10Jun 23, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Dec 26, 2024Updated last year
- Sumo OSM short usage tutorial☆15Feb 7, 2018Updated 8 years ago
- ☆63Jun 22, 2018Updated 7 years ago
- ☆14Mar 21, 2021Updated 5 years ago
- ☆10Apr 24, 2021Updated 5 years ago
- A simple baseline for mountain-car @ gym☆12Jan 15, 2020Updated 6 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Nov 15, 2019Updated 6 years ago