Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML
☆20Dec 17, 2023Updated 2 years ago
Alternatives and similar repositories for ICM-PPO-implementation
Users that are interested in ICM-PPO-implementation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Intrinsic Curiosity Module (ICM) + PPO on the Pyramid and PushBlock environment.☆12Sep 3, 2019Updated 6 years ago
- Pytorch implementation of intrinsic curiosity module with proximal policy optimization☆55Dec 20, 2018Updated 7 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆148Jan 12, 2019Updated 7 years ago
- ☆11Jan 20, 2025Updated last year
- ☆10Sep 13, 2025Updated 9 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ROBOTICS - OBSTACLE AVOIDANCE☆11Dec 5, 2023Updated 2 years ago
- ☆13Jun 1, 2020Updated 6 years ago
- Decoupled Q-Chunking☆72May 3, 2026Updated last month
- bvh_broadcaster: broadcasting bvh motion capture as tf in ROS☆10Apr 8, 2021Updated 5 years ago
- rl_collision_avoidance test in gazebo simulator☆12Feb 15, 2021Updated 5 years ago
- Multi-Robot Collision Avoidance using Reinforcement Learning☆13Jun 17, 2018Updated 7 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- #UAI2020 Codes for PAC-Bayesian Contrastive Unsupervised Representation Learning☆14May 23, 2022Updated 4 years ago
- ☆19Jan 1, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code and files from a project regarding UAV path planning in a SAR situation. The project was done for the 8th semester of the Operations…☆11Dec 8, 2021Updated 4 years ago
- Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"☆11Oct 22, 2020Updated 5 years ago
- multi-agent crafter for cooperative tasks☆13Aug 2, 2025Updated 10 months ago
- Implementation of Neurips 2023 Paper "Multi Time Scale World Models"☆17Nov 8, 2024Updated last year
- rao-blackwellized particle filter implemented on grid map☆18Sep 6, 2020Updated 5 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆18Apr 15, 2022Updated 4 years ago
- ☆11Apr 23, 2024Updated 2 years ago
- A simulation of the robot navigation problem in Gymnasium.☆20Jul 12, 2025Updated 11 months ago
- Extending PRD to MAPPO with soft and semi-hard attention mechanisms☆14May 26, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆23Apr 28, 2021Updated 5 years ago
- Application of an LSTM-based policy gradient on an RL agent☆15Aug 24, 2022Updated 3 years ago
- Implementation of Proximal Policy Optimization using Transformer☆12Jul 4, 2023Updated 2 years ago
- Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…☆21Feb 27, 2023Updated 3 years ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆16Mar 13, 2026Updated 3 months ago
- ☆13Apr 25, 2023Updated 3 years ago
- It simulates Rosbot movement on Gazebo and trains a rainforcement learning model DQN.☆20Apr 5, 2022Updated 4 years ago
- ☆49Apr 22, 2013Updated 13 years ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆82Oct 25, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35May 21, 2024Updated 2 years ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆25Dec 29, 2023Updated 2 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- This repository contains the implementation of reinforcement learning algorithms like PPO and A2C, to solve the problem: Dynamic Obstacle…☆19Jan 17, 2022Updated 4 years ago
- A curated list of awesome memory in reinforcement learning research materials☆24Sep 5, 2021Updated 4 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆95Jan 15, 2024Updated 2 years ago
- PyTorch implementation of Sample Efficient Actor-Critic with Experience Replay(ACER)☆16Oct 7, 2020Updated 5 years ago