Pytorch implementation of intrinsic curiosity module with proximal policy optimization
☆55Dec 20, 2018Updated 7 years ago
Alternatives and similar repositories for pytorch_ppo_rl
Users that are interested in pytorch_ppo_rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆147Jan 12, 2019Updated 7 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆20Dec 17, 2023Updated 2 years ago
- The implement of the policy gradient RL algorithm with pytorch☆41Dec 7, 2020Updated 5 years ago
- A repository for implementation of deep reinforcement learning lectured at Samsung☆110Sep 20, 2021Updated 4 years ago
- ☆69Nov 30, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Proximal policy optimization in PyTorch. Easy to read and understand.☆51Oct 30, 2020Updated 5 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆82Oct 25, 2020Updated 5 years ago
- ☆40Jul 29, 2019Updated 6 years ago
- Sumo OSM short usage tutorial☆15Feb 7, 2018Updated 8 years ago
- ☆12Jan 3, 2022Updated 4 years ago
- ☆49Apr 15, 2019Updated 7 years ago
- ☆22Oct 14, 2019Updated 6 years ago
- Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method☆47Sep 11, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- code for "Data Might be Enough: Bridge Real-World Traffic Signal Control Using Offline Reinforcement Learning"☆11May 2, 2024Updated 2 years ago
- Reproduction of Curiosity-driven Exploration by Self-supervised Prediction in PyTorch☆13Jun 10, 2019Updated 6 years ago
- A reinforcement learning based behaviour planner for autonomous driving agents☆20Jan 8, 2021Updated 5 years ago
- ☆14Nov 24, 2022Updated 3 years ago
- RUDDER: Return Decomposition for Delayed Rewards☆48Sep 17, 2020Updated 5 years ago
- [NeurIPS 2022] Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergen…☆13Oct 7, 2022Updated 3 years ago
- TensorFlow KR에 소개된 reddit 글 구현☆11Sep 26, 2018Updated 7 years ago
- Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)☆371Aug 1, 2019Updated 6 years ago
- This study is to investigate the optimal control strategies at crosswalks using traffic signal controllers. A multi-agent reinforcement l…☆12Jan 3, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- IQL, QMIX, VDN, COMA, QTRAN (QTRAN-Base and QTRAN-Alt), MAVEN, CommNet, DYMA-Cl, G2ANet, and MADDPG☆19Dec 6, 2021Updated 4 years ago
- MSc Informatics dissertation project - University of Edinburgh: Curiosity in Multi-Agent Reinforcement Learning☆13Aug 16, 2019Updated 6 years ago
- Basic reinforcement learning implementation with tensorflow version 2.0☆52Apr 20, 2020Updated 6 years ago
- codes for paper 《Neighborhood Cooperative Multiagent Reinforcement Learning for Adaptive Traffic Signal Control in Epidemic Regions》☆14Apr 3, 2022Updated 4 years ago
- clear single-file JAX implementations of common RL algorithms☆15Sep 5, 2021Updated 4 years ago
- ppo+action mask for atari tennis agent☆12Mar 2, 2023Updated 3 years ago
- Deep RL agents with PyTorch☆36Sep 25, 2021Updated 4 years ago
- A Test-Implementation of the IMPALA algorithm (by deepmind 2018)☆35Mar 16, 2018Updated 8 years ago
- ☆20Jan 22, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An Implementation of Recurrent Experience Replay in Distributed Reinforcement Learning (Kapturowski et al. 2019) in PyTorch☆54Jul 19, 2022Updated 3 years ago
- PyTorch implementation of R2D2 (Recurrent Replay Distributed DPG (not DQN))☆14Mar 22, 2019Updated 7 years ago
- Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.☆16Jun 5, 2019Updated 6 years ago
- PyTorch implementation of deep reinforcement learning algorithms☆488Nov 19, 2021Updated 4 years ago
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- Structural implementation of RL key algorithms☆517Apr 8, 2023Updated 3 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago