Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML
☆20Dec 17, 2023Updated 2 years ago
Alternatives and similar repositories for ICM-PPO-implementation
Users that are interested in ICM-PPO-implementation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Intrinsic Curiosity Module (ICM) + PPO on the Pyramid and PushBlock environment.☆12Sep 3, 2019Updated 6 years ago
- Pytorch implementation of intrinsic curiosity module with proximal policy optimization☆55Dec 20, 2018Updated 7 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆147Jan 12, 2019Updated 7 years ago
- Code for Paper "Gradient Informed Proximal Policy Optimization" (NeurIPS 2023)☆27Dec 18, 2023Updated 2 years ago
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆25Aug 14, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆12Aug 4, 2023Updated 2 years ago
- ☆18Jan 30, 2025Updated last year
- ☆10Sep 13, 2025Updated 8 months ago
- Non-official implementation of paper "In-context Reinforcement Learning with Algorithm Distillation"☆12Aug 15, 2024Updated last year
- Implement reinforcement learning algorithms to realize highway decision making of autonomous vehicles☆12Apr 27, 2023Updated 3 years ago
- ROBOTICS - OBSTACLE AVOIDANCE☆11Dec 5, 2023Updated 2 years ago
- Robot Learning from Expert Demonstration Using IRL☆13Mar 21, 2021Updated 5 years ago
- Decoupled Q-Chunking☆68May 3, 2026Updated 3 weeks ago
- bvh_broadcaster: broadcasting bvh motion capture as tf in ROS☆10Apr 8, 2021Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- rl_collision_avoidance test in gazebo simulator☆12Feb 15, 2021Updated 5 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- ☆18Jan 1, 2025Updated last year
- Code and files from a project regarding UAV path planning in a SAR situation. The project was done for the 8th semester of the Operations…☆11Dec 8, 2021Updated 4 years ago
- Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"☆11Oct 22, 2020Updated 5 years ago
- multi-agent crafter for cooperative tasks☆13Aug 2, 2025Updated 9 months ago
- Implementation of Neurips 2023 Paper "Multi Time Scale World Models"☆17Nov 8, 2024Updated last year
- Code of Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions☆13May 22, 2023Updated 3 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official Github Repository for "Trust Region-Based Safe Distributional Reinforcement Learning for Multiple Constraints". (NeurIPS 2023)☆20Nov 30, 2025Updated 5 months ago
- This project aims to use a combination of imitation learning and reinforcement learning in order to play Asseto Corsa by learning new pol…☆23Sep 10, 2020Updated 5 years ago
- Extending PRD to MAPPO with soft and semi-hard attention mechanisms☆14May 26, 2022Updated 4 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆23Apr 28, 2021Updated 5 years ago
- A bachelor thesis project about autonomous car maneuver around roundabout using RL-DQN☆14Nov 26, 2023Updated 2 years ago
- Application of an LSTM-based policy gradient on an RL agent☆15Aug 24, 2022Updated 3 years ago
- Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…☆21Feb 27, 2023Updated 3 years ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆16Mar 13, 2026Updated 2 months ago
- ☆13Apr 25, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- It simulates Rosbot movement on Gazebo and trains a rainforcement learning model DQN.☆20Apr 5, 2022Updated 4 years ago
- ☆49Apr 22, 2013Updated 13 years ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆82Oct 25, 2020Updated 5 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35May 21, 2024Updated 2 years ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆24Dec 29, 2023Updated 2 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- This repository contains the implementation of reinforcement learning algorithms like PPO and A2C, to solve the problem: Dynamic Obstacle…☆19Jan 17, 2022Updated 4 years ago