zafarali/emdp

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zafarali/emdp)

zafarali / emdp

Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations

☆49

Alternatives and similar repositories for emdp

Users that are interested in emdp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ruizhaogit / mep
View on GitHub
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)
☆24May 30, 2019Updated 7 years ago
google-research / policy-learning-landscape
View on GitHub
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Jan 16, 2019Updated 7 years ago
rlai-lab / Regularized-GradientTD
View on GitHub
Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.
☆38Oct 14, 2020Updated 5 years ago
jinnaiyuu / Optimal-Options-ICML-2019
View on GitHub
Code for generating options for planning and reinforcement learning
☆12Feb 18, 2021Updated 5 years ago
ermongroup / CalibratedModelBasedRL
View on GitHub
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆54May 15, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ankitkv / TD-VAE
View on GitHub
TD-VAE in PyTorch
☆10May 28, 2019Updated 7 years ago
nnaisense / MAGE
View on GitHub
Learning Action-Value Gradients in Model-based Policy Optimization
☆32Sep 7, 2021Updated 4 years ago
jeappen / gym-grid
View on GitHub
A simple Gridworld environment for Open AI gym
☆25Jun 10, 2018Updated 8 years ago
fusion-ml / trajectory-information-rl
View on GitHub
Bayesian active RL (BARL) and trajectory information planning (TIP)
☆26Oct 11, 2022Updated 3 years ago
bonniesjli / DQN_SR
View on GitHub
Count based exploration with the successor representation for Unity ML's Pyramid
☆12Jun 19, 2019Updated 7 years ago
metekemertas / RobustBisimulation
View on GitHub
Learning bisimulation metrics for control, particularly suited to sparse reward settings
☆11Feb 28, 2023Updated 3 years ago
RajGhugare19 / VE-principle-for-model-based-RL
View on GitHub
Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Le…
☆18Apr 13, 2021Updated 5 years ago
dnishio / DSAC
View on GitHub
The implementation of Discriminator Soft Actor Critic
☆15Jan 25, 2020Updated 6 years ago
bstadie / krazyworld
View on GitHub
krazy grid world
☆26Mar 2, 2020Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
watchernyu / REDQ
View on GitHub
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆185Nov 14, 2024Updated last year
flowersteam / rl-difference-testing
View on GitHub
Simple tools for statistical analyses in RL experiments
☆67Jun 21, 2018Updated 8 years ago
zackchase / intrinsic-fear-dqn
View on GitHub
Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.
☆10Nov 13, 2017Updated 8 years ago
shamanez / VUSFA-Variational-Universal-Successor-Features-Approximator
View on GitHub
This repository contains implementations of the paper VUSFA
☆14Mar 31, 2021Updated 5 years ago
ChunyuanLI / RAS
View on GitHub
AISTATS 2019: Reference-based Adversarial Sampling & Its applications to Soft Q-learning
☆15Jan 21, 2019Updated 7 years ago
quanvuong / Supervised_Policy_Update
View on GitHub
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Dec 8, 2022Updated 3 years ago
david-abel / rl_info_theory
View on GitHub
A collection of code investigating the use of information theory for abstractions in RL
☆16Nov 14, 2018Updated 7 years ago
ming93 / Safe_reinforcement_learning
View on GitHub
Convergent Policy Optimization for Safe Reinforcement Learning
☆11Oct 26, 2019Updated 6 years ago
schmidtdominik / Rainbow
View on GitHub
Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …
☆44Dec 11, 2021Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
IBM / constrained-rl
View on GitHub
Constrained Exploration and Recovery from Experience Shaping
☆22Apr 18, 2019Updated 7 years ago
gsastry / human-rl
View on GitHub
Code for human intervention reinforcement learning
☆35Jan 8, 2018Updated 8 years ago
kvfrans / powderworld
View on GitHub
Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
☆74Aug 31, 2024Updated last year
lcalem / reproduction-soft-qlearning-mutual-information
View on GitHub
Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.
☆10Jan 10, 2019Updated 7 years ago
ben-eysenbach / info_geometry
View on GitHub
Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"
☆20Oct 6, 2021Updated 4 years ago
qlan3 / Explorer
View on GitHub
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
☆98Jul 17, 2026Updated last week
ssingh82 / rl_nav
View on GitHub
This is the accompannying code for the paper "SLAM-Safe Planner: Preventing Monocular SLAM Failure using Reinforcement Learning" and "Dat…
☆18Sep 15, 2017Updated 8 years ago
rcheng805 / CORE-RL
View on GitHub
Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcemen…
☆32Jan 7, 2021Updated 5 years ago
salesforce / sibling-rivalry
View on GitHub
Code for Sibling Rivalry and experiments presented in associated paper
☆18May 1, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ubisoft / ubisoft-laforge-asaf
View on GitHub
Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
☆16Dec 10, 2020Updated 5 years ago
AlgTUDelft / AlwaysSafe
View on GitHub
Code for the paper "AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training"
☆17May 9, 2022Updated 4 years ago
DartML / PPO-Stein-Control-Variate
View on GitHub
Proximal Policy Optimization with Stein Control Variates:
☆33Feb 12, 2018Updated 8 years ago
behaviorguidedRL / BGRL
View on GitHub
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Jun 24, 2020Updated 6 years ago
art-ai / pypsdd
View on GitHub
The Python PSDD Package
☆19Jul 20, 2025Updated last year
bhairavmehta95 / ant-env
View on GitHub
Ant Gather and Ant Maze envs, separated from RLLab
☆11Aug 2, 2018Updated 7 years ago
ryanelandt / PressureFieldContact.jl
View on GitHub
Elastic foundation contact model for rigid body dynamics.
☆10Jun 11, 2020Updated 6 years ago