Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆18Apr 15, 2022Updated 4 years ago
Alternatives and similar repositories for icmppo
Users that are interested in icmppo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Dec 23, 2024Updated last year
- Code and additional information for our paper entitled 'Scene Augmentation Methods for Interactive Embodied AI Tasks'☆10Apr 25, 2023Updated 3 years ago
- The Laser Learning Environment (LLE) is a cooperative MARL grid-world☆13Jun 26, 2026Updated last week
- ☆27Jul 29, 2025Updated 11 months ago
- ☆17Oct 18, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆27Jun 16, 2026Updated 2 weeks ago
- Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic.☆20May 18, 2020Updated 6 years ago
- This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…☆23Jul 6, 2023Updated 2 years ago
- COOM: Benchmarking Continual Reinforcement Learning on Doom☆27Mar 5, 2026Updated 3 months ago
- A minimal example of Abductive Learning☆19Dec 6, 2023Updated 2 years ago
- Memory Augmented Neural Networks (Pytorch)☆14Sep 2, 2018Updated 7 years ago
- [RAL-25] SIGN: Safety-Aware Image-Goal Navigation for Autonomous Drones via Reinforcement Learning☆50May 13, 2026Updated last month
- Logic Reinforcement Learning☆21Oct 20, 2025Updated 8 months ago
- Code for simulations in "Computational mechanisms of curiosity and goal-directed exploration"☆11May 22, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A paper list of our recent survey on continual learning, and other useful resources in this field.☆108Feb 21, 2024Updated 2 years ago
- DAR introduces the diagonal scanning order for next-token prediction and proposes a direction-aware autoregressive transformer framework.☆19Apr 16, 2025Updated last year
- Transcribing long blocks of speech using Watson Speech To Text.☆11Sep 24, 2020Updated 5 years ago
- Promoss Topic Modelling Toolbox☆11Jan 21, 2019Updated 7 years ago
- Code for "Auxiliary Tasks Speed Up Learning PointGoal Navigation"☆20Nov 27, 2020Updated 5 years ago
- ☆28Jun 24, 2026Updated last week
- ☆11Mar 9, 2018Updated 8 years ago
- Neural variational inference and learning in undirected graphical models http://www.stanford.edu/~kuleshov/papers/nips2017.pdf☆17Apr 25, 2018Updated 8 years ago
- The source code is related to our work- Shreyas Seshadri, Ulpu Remes and Okko Rasanen: "Dirichlet process mixture models for clustering i…☆10Aug 18, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- project that aims to run on raspberry pi and take voice commands and answering them using chatgpt api☆19May 24, 2023Updated 3 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆20Dec 17, 2023Updated 2 years ago
- Official Implementation of Mamba Adaptive Anomaly Transformer (MAAT) - paper is now available in Engineering Applications of Artificial I…☆31Nov 14, 2025Updated 7 months ago
- ☆12Mar 25, 2025Updated last year
- Code release for the paper "Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control"☆17Apr 9, 2024Updated 2 years ago
- Variational Auto-Regressive Gaussian Processes for Continual Learning☆22Jun 15, 2021Updated 5 years ago
- ☆13Jun 18, 2024Updated 2 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆28May 12, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Do the YOLOv5 model inference by OpenCV/OpenVINO based on onnx model format☆13Feb 6, 2023Updated 3 years ago
- Navigation agent with Bayesian relational memory in the House3D environment☆30Sep 13, 2019Updated 6 years ago
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆43Jul 14, 2025Updated 11 months ago
- ☆11Jun 15, 2019Updated 7 years ago
- Source code for paper "Trajectory of Alternating Direction Method of Multipliers and Adaptive Acceleration" of NeurIPS 2019☆10Jan 25, 2024Updated 2 years ago
- Emotional First Aid Raw Dataset, 心理咨询问答原始语料库☆23Mar 6, 2026Updated 3 months ago
- ☆27Mar 21, 2025Updated last year