makokal/MDPN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/makokal/MDPN)

makokal / MDPN

Unified notation for Markov Decision Processes PO(MDP)s

☆24

Alternatives and similar repositories for MDPN

Users that are interested in MDPN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

urbanslug / graphite
View on GitHub
A varitation graph tool
☆10Dec 23, 2019Updated 6 years ago
dirkweissenborn / qa_network
View on GitHub
Implementation of QA Networks
☆10Jul 14, 2016Updated 10 years ago
IDSIA / lmtool-fwp
View on GitHub
PyTorch Language Modeling Toolkit for Fast Weight Programmers
☆22Jun 11, 2025Updated last year
emmaajordan / EvaluationOfRLAlgs
View on GitHub
This repository contains the code used in the paper Evaluating the Performance of Reinformcent Learning Algorithms
☆27Aug 14, 2021Updated 4 years ago
rl-lang-grounding / rl-lang-ground
View on GitHub
Tensorflow code for WACV 2019 paper "Attention Based Natural Language Grounding by Navigating Virtual Environment" - https://arxiv.org/ab…
☆17Nov 7, 2018Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
quantified-uncertainty / ai-safety-papers
View on GitHub
☆22Sep 9, 2021Updated 4 years ago
sebastien-forestier / NIPS2016
View on GitHub
Autonomous exploration, active learning and human guidance with open-source Poppy humanoid robot platform and Explauto library
☆18May 22, 2018Updated 8 years ago
rltheorybook / rltheorybook.github.io
View on GitHub
☆29Jun 27, 2026Updated 3 weeks ago
andnp / PyExpUtils
View on GitHub
Experiment utility code, specifically designed for use with Compute Canada.
☆11Jan 27, 2025Updated last year
uclnlp / adversarial-nli
View on GitHub
Code and data for the CoNLL 2018 paper "Adversarially Regularising Neural NLI Models to Integrate Logical Background Knowledge."
☆25Jan 21, 2019Updated 7 years ago
google-research / dice_rl
View on GitHub
☆114Jul 3, 2026Updated 3 weeks ago
carpedm20 / RCMN
View on GitHub
Recurrent Convolutional Memory Network (in progress)
☆29Apr 16, 2016Updated 10 years ago
ondrejbiza / racetrack
View on GitHub
An environment for tabular Reinforcement Learning agents.
☆14Jun 13, 2018Updated 8 years ago
bhairavmehta95 / ant-env
View on GitHub
Ant Gather and Ant Maze envs, separated from RLLab
☆11Aug 2, 2018Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
psclklnk / spdl
View on GitHub
Source code for the Self-Paced Deep Reinforcement Learning Experiments
☆31Mar 24, 2023Updated 3 years ago
mschulth / rhc
View on GitHub
Implementation of Receding Horizon Curiosity Algrithm
☆13Mar 24, 2023Updated 3 years ago
MLD3 / OfflineRL_ModelSelection
View on GitHub
[MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003
☆11Oct 6, 2022Updated 3 years ago
mtrazzi / gym-alttp-gridworld
View on GitHub
A gym environment for Stuart Armstrong's model of a treacherous turn.
☆18Jul 28, 2018Updated 7 years ago
MLD3 / OfflineRL_FactoredActions
View on GitHub
[NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738
☆11Nov 27, 2022Updated 3 years ago
dtak / POPCORN-POMDP
View on GitHub
Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)
☆11May 19, 2021Updated 5 years ago
rlai-lab / Regularized-GradientTD
View on GitHub
Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.
☆38Oct 14, 2020Updated 5 years ago
MLD3 / RL4BG
View on GitHub
Public code release for "Deep Reinforcement Learning for Closed-Loop Blood Glucose Control" (Ian Fox et al.), MLHC 2020. https://arxiv.or…
☆13Feb 5, 2021Updated 5 years ago
clinicalml / trajectory-inspection
View on GitHub
Code for "Trajectory Inspection: A Method for Iterative Clinician-Driven Design of Reinforcement Learning Studies"
☆16Oct 15, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
suinleelab / cxr_adv
View on GitHub
Repository for the paper "An Adversarial Approach for the Robust Classification of Pneumonia from Chest Radiographs"
☆19Jan 14, 2020Updated 6 years ago
ericmedvet / 2dhmsr
View on GitHub
Java framework for experimenting with a 2-D version of the voxel-based soft robots.
☆20Mar 31, 2023Updated 3 years ago
JonathanCrabbe / Symbolic-Pursuit
View on GitHub
Github for the NIPS 2020 paper "Learning outside the black-box: at the pursuit of interpretable models"
☆14Sep 7, 2022Updated 3 years ago
two2tee / WorldModelPlanning
View on GitHub
☆17Mar 21, 2021Updated 5 years ago
alistairewj / icu-model-transfer
View on GitHub
Evaluating methods to improve model transfer for intensive care unit models
☆16Jul 6, 2023Updated 3 years ago
kristychoi / pixel_exploration
View on GitHub
PyTorch implementation of Count-Based Exploration with Neural Density Models
☆10Mar 22, 2018Updated 8 years ago
facebookarchive / NACS
View on GitHub
Jump to better conclusions: SCAN both left and right
☆11Jan 24, 2019Updated 7 years ago
radekosmulski / presidential
View on GitHub
☆11Feb 12, 2018Updated 8 years ago
openai / gym-wikinav
View on GitHub
Wikipedia navigation environment for OpenAI Gym
☆40Apr 2, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lil-lab / blocks
View on GitHub
Blocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)
☆40Feb 7, 2019Updated 7 years ago
ryonakamura / parlai_agents
View on GitHub
# ParlAI Agent examples with PyTorch, Chainer and TensorFlow
☆46Jan 19, 2018Updated 8 years ago
poppingtonic / dl-studies
View on GitHub
Notebooks and notes on data-driven experiments derived from my studies with fast.ai's courses.
☆10Sep 3, 2022Updated 3 years ago
brain-research / mirage-rl
View on GitHub
Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.
☆17Aug 2, 2018Updated 7 years ago
RobertCsordas / onion_representations
View on GitHub
☆13Aug 19, 2024Updated last year
Cranial-XIX / metric-residual-network
View on GitHub
Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning
☆20Jan 11, 2023Updated 3 years ago
futurulus / coop-nets
View on GitHub
Scalable learning with pragmatics
☆11Mar 31, 2018Updated 8 years ago