Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆18Apr 15, 2022Updated 4 years ago
Alternatives and similar repositories for icmppo
Users that are interested in icmppo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Dec 23, 2024Updated last year
- A Continual Learning Library in PyTorch and JAX☆13Apr 18, 2023Updated 3 years ago
- Official codebase for "TAU-106K: A New Dataset for Comprehensive Understanding of Traffic Accident"☆20Apr 19, 2025Updated last year
- [TIV 2024] PolarPoint-BEV: Bird-Eye-View Perception in Polar Points for Explainable End-to-End Autonomous Driving☆23Mar 28, 2026Updated last month
- This is the official implementation of WiseAD.☆26Apr 22, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Model-based Policy Gradients☆32Mar 12, 2020Updated 6 years ago
- ☆27Jul 29, 2025Updated 9 months ago
- ☆17Oct 18, 2022Updated 3 years ago
- ☆19Jan 9, 2025Updated last year
- ☆24May 12, 2025Updated 11 months ago
- Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic.☆20May 18, 2020Updated 5 years ago
- ☆13Dec 12, 2022Updated 3 years ago
- This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…☆23Jul 6, 2023Updated 2 years ago
- COOM: Benchmarking Continual Reinforcement Learning on Doom☆24Mar 5, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Memory Augmented Neural Networks (Pytorch)☆14Sep 2, 2018Updated 7 years ago
- Code for simulations in "Computational mechanisms of curiosity and goal-directed exploration"☆11May 22, 2020Updated 5 years ago
- Pytorch Implementation of Deepmind's 'Hybrid computing using a neural network with dynamic external memory' (Differentiable Neural Comput…☆20Dec 9, 2017Updated 8 years ago
- DAR introduces the diagonal scanning order for next-token prediction and proposes a direction-aware autoregressive transformer framework.☆19Apr 16, 2025Updated last year
- Transcribing long blocks of speech using Watson Speech To Text.☆11Sep 24, 2020Updated 5 years ago
- ☆24Apr 21, 2026Updated last week
- ☆11Mar 9, 2018Updated 8 years ago
- Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving☆65Mar 11, 2026Updated last month
- Epi: An Open Humanoid Platform☆17Jun 18, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Neural variational inference and learning in undirected graphical models http://www.stanford.edu/~kuleshov/papers/nips2017.pdf☆17Apr 25, 2018Updated 8 years ago
- The source code is related to our work- Shreyas Seshadri, Ulpu Remes and Okko Rasanen: "Dirichlet process mixture models for clustering i…☆10Aug 18, 2017Updated 8 years ago
- Learning Kinematic Feasibility through Reinforcement Leanring: http://rl.uni-freiburg.de/research/kinematic-feasibility-rl☆24Jan 27, 2021Updated 5 years ago
- Reinforcement Learning papers on exploration methods.☆19Jun 27, 2021Updated 4 years ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- Codes for the paper "HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism"☆26Oct 22, 2022Updated 3 years ago
- project that aims to run on raspberry pi and take voice commands and answering them using chatgpt api☆19May 24, 2023Updated 2 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆20Dec 17, 2023Updated 2 years ago
- Occupancy grid mapping based on 2D Lidar data assuming perfect knowledge of a robot's trajectory.☆11May 23, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code release for the paper "Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control"☆17Apr 9, 2024Updated 2 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆29May 12, 2025Updated 11 months ago
- ☆13Jun 18, 2024Updated last year
- Do the YOLOv5 model inference by OpenCV/OpenVINO based on onnx model format☆13Feb 6, 2023Updated 3 years ago
- Github page for the preprint paper "InfoCatVAE: Representation Learning with Categorical Variational Autoencoders"☆14Oct 23, 2020Updated 5 years ago
- Navigation agent with Bayesian relational memory in the House3D environment☆30Sep 13, 2019Updated 6 years ago
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆42Jul 14, 2025Updated 9 months ago