Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
☆49Apr 1, 2022Updated 3 years ago
Alternatives and similar repositories for emdp
Users that are interested in emdp are comparing it to the libraries listed below
Sorting:
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24May 30, 2019Updated 6 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Jan 16, 2019Updated 7 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54May 15, 2019Updated 6 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Nov 14, 2018Updated 7 years ago
- A simple Gridworld environment for Open AI gym☆25Jun 10, 2018Updated 7 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Le…☆18Apr 13, 2021Updated 4 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Nov 14, 2024Updated last year
- Code for Sibling Rivalry and experiments presented in associated paper☆17May 1, 2025Updated 10 months ago
- A place for YARP devices☆11Oct 28, 2025Updated 4 months ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆44Dec 11, 2021Updated 4 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆37Oct 14, 2020Updated 5 years ago
- JavaScript bindings for YARP!☆10Jun 17, 2024Updated last year
- Elastic foundation contact model for rigid body dynamics.☆10Jun 11, 2020Updated 5 years ago
- Bayesian active RL (BARL) and trajectory information planning (TIP)☆26Oct 11, 2022Updated 3 years ago
- krazy grid world☆25Mar 2, 2020Updated 6 years ago
- ☆28Jan 11, 2021Updated 5 years ago
- Constrained Exploration and Recovery from Experience Shaping☆22Apr 18, 2019Updated 6 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- modules required for running Roboy at fairs☆12Sep 24, 2018Updated 7 years ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Jan 9, 2018Updated 8 years ago
- SymPP: A Symbolic Library that compiles itself☆13Nov 23, 2020Updated 5 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- OSP C co-simulation API☆13Aug 15, 2025Updated 6 months ago
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆10Feb 28, 2023Updated 3 years ago
- Yarp modules and devices for autonomous navigation☆10Jan 16, 2026Updated last month
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆97Jun 19, 2025Updated 8 months ago
- Positive-pressure medical ventilator system using Simscape™☆12May 7, 2024Updated last year
- Awesome openai gym environments☆12Aug 6, 2019Updated 6 years ago
- Example code☆10Feb 4, 2021Updated 5 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆46Sep 20, 2023Updated 2 years ago
- Comp 781 Project☆10Jan 2, 2026Updated 2 months ago
- Exploration of motion capture using a OptiTrack NatNet system☆14Apr 20, 2021Updated 4 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆197Dec 8, 2022Updated 3 years ago