Paul-543NA/matrix-mdp-gym

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Paul-543NA/matrix-mdp-gym)

Paul-543NA / matrix-mdp-gym

A reinforcement leaning environment for discrete MDPs.

☆25

Alternatives and similar repositories for matrix-mdp-gym

Users that are interested in matrix-mdp-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FragileTech / plangym
View on GitHub
Library that provides environments for planning problems
☆17Apr 24, 2026Updated 3 months ago
cor3bit / bertsekas-marl
View on GitHub
PyTorch Implementation of the Sequential Multiagent Rollout algorithm
☆11Jun 28, 2024Updated 2 years ago
RLG-Leiden / edugym
View on GitHub
☆15Sep 22, 2023Updated 2 years ago
damat-le / gym-simplegrid
View on GitHub
Simple Grid Environment for Gymnasium
☆65Mar 1, 2026Updated 5 months ago
QianJaneXie / PandoraBayesOpt
View on GitHub
Cost-aware Bayesian optimization via the Pandora's box Gittins index
☆13Aug 8, 2025Updated 11 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
zohannn / HUMP
View on GitHub
This is a Human-like Upper-limb Motion Planner (HUMP) for the generation of arm-hand movements in humanoid robots.
☆12Mar 4, 2022Updated 4 years ago
DavidRother / cooking_zoo
View on GitHub
CookingZoo: a gym-cooking derivative to simulate a complex cooking environment
☆22Dec 6, 2024Updated last year
lasgroup / aceirl
View on GitHub
Implementation of "Active Exploration for Inverse Reinforcement Learning (AceIRL), NeurIPS 2022.
☆14Oct 12, 2022Updated 3 years ago
harish-kamath / rqae
View on GitHub
Residual Quantization Autoencoder, used for interpreting LLMs
☆14Jan 1, 2025Updated last year
mechanism-learning-research / two-player-auctions
View on GitHub
JAX/Haiku implementation of "Auction Learning as a Two-Player Game"
☆11Jul 6, 2024Updated 2 years ago
eleyng / table-carrying-ai
View on GitHub
An environment for table-carrying, a joint-action cooperative task.
☆10Jan 8, 2024Updated 2 years ago
facebookresearch / qEUBO
View on GitHub
Reproducible code for paper "qEUBO A Decision-Theoretic Acquisition Function for Preferential Bayesian Optimization" from AISTATS 2023
☆23Mar 24, 2023Updated 3 years ago
osrf / servicesim
View on GitHub
Service Robot Simulator
☆11May 3, 2020Updated 6 years ago
Psi-Prod / ppx_system
View on GitHub
ppx_system is a syntax extension to known operating system at compile time
☆12May 9, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ArthurConmy / MishformerLens
View on GitHub
MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…
☆10Oct 7, 2024Updated last year
Livioni / Reinforcement-Learning-of-Sequential-Price-Mechanisms
View on GitHub
论文Reinforcement Learning of Sequential Price Mechanisms的复现
☆12Nov 3, 2022Updated 3 years ago
NathanGavenski / IL-Datasets
View on GitHub
This is a project for creating and using IL datasets based on HuggingFace weights with multithreads for performance, and benchmarking
☆13Jun 23, 2026Updated last month
Butanium / monte-carlo-tree-search-TSP
View on GitHub
Monte Carlo tree search for the travelling salesman problem (MCTS for the TSP)
☆12Jun 18, 2022Updated 4 years ago
ninell-oldenburg / social-contracts
View on GitHub
☆13Mar 12, 2024Updated 2 years ago
MetaCell / nwb-explorer
View on GitHub
NWB Explorer is a web application to visualise and analyse the content of NWB:N 2 files
☆27Aug 28, 2025Updated 11 months ago
facebookresearch / preference-exploration
View on GitHub
Code for replicating experiments from the paper, Preference Exploration for Efficient Bayesian Optimization with Multiple Outcomes, publi…
☆14Jun 22, 2023Updated 3 years ago
aryandeshwal / BODi
View on GitHub
☆12Mar 17, 2024Updated 2 years ago
wujian16 / TwoStep-BayesOpt
View on GitHub
NeurIPS 2019 Paper
☆12Dec 9, 2019Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
WeihaoTan / gym-macro-overcooked
View on GitHub
☆16May 11, 2023Updated 3 years ago
Tim-ats-d / Macron
View on GitHub
A powerful keybind library and daemon for Linux.
☆11Jul 24, 2022Updated 4 years ago
QueraTeam / mattermost-vazirmatn
View on GitHub
Change Mattermost font to Vazirmatn
☆15Sep 29, 2023Updated 2 years ago
robostac / coders-strike-back-referee
View on GitHub
Brutaltester compatible referee for coders strike back
☆13Jun 1, 2026Updated 2 months ago
shadowkiller33 / Contrast-Instruction
View on GitHub
☆19Oct 2, 2023Updated 2 years ago
tianyilim / RRTx
View on GitHub
An implementation of the RRTx Algorithm in Python
☆11Apr 16, 2024Updated 2 years ago
benavoli / SkewGP
View on GitHub
Skew Gaussian Processes by Alessio Benavoli, Dario Azzimonti and Dario Piga
☆16Aug 5, 2025Updated 11 months ago
drbenvincent / darc_toolbox
View on GitHub
Run adaptive decision making experiments
☆16Nov 9, 2021Updated 4 years ago
jbkjr / train-procgen-pytorch
View on GitHub
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆14May 17, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Sadie-Zhao / Zero-Sum-Stochastic-Stackelberg-Games-NeurIPS
View on GitHub
This is the code repository for the paper "Zero-Sum Stochastic Stackelberg Games".
☆18Oct 12, 2022Updated 3 years ago
CMACH508 / 2020-GNN-MCTS-TSP
View on GitHub
☆13Jun 30, 2020Updated 6 years ago
x35f / model_based_rl
View on GitHub
model based reinforcement learning algorithms for unstable baselines
☆15May 9, 2023Updated 3 years ago
asaran / VeSSAL
View on GitHub
This repository contains code used to conduct experiments reported in the paper "Streaming Active Learning with Deep Neural Networks" acc…
☆14Mar 7, 2025Updated last year
mattbdean / Helium
View on GitHub
A companion website for DataJoint
☆10Feb 13, 2026Updated 5 months ago
IBM / forbiditerative
View on GitHub
ForbidIterative planners for top-k, top-quality, and diverse planning problems
☆23Oct 4, 2025Updated 9 months ago
moratodpg / imp_marl
View on GitHub
IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL
☆46May 18, 2026Updated 2 months ago