A reinforcement leaning environment for discrete MDPs.
☆25Nov 10, 2024Updated last year
Alternatives and similar repositories for matrix-mdp-gym
Users that are interested in matrix-mdp-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Library that provides environments for planning problems☆16Updated this week
- PyTorch Implementation of the Sequential Multiagent Rollout algorithm☆11Jun 28, 2024Updated last year
- ☆15Sep 22, 2023Updated 2 years ago
- Preprint | Previously at GenBio ICML 2025☆19Aug 20, 2025Updated 7 months ago
- Simple Grid Environment for Gymnasium☆66Mar 1, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This is a Human-like Upper-limb Motion Planner (HUMP) for the generation of arm-hand movements in humanoid robots.☆11Mar 4, 2022Updated 4 years ago
- Code for the paper "Learning to Do or Learning While Doing: Reinforcement Learning and Bayesian Optimisation for Online Continuous Tuning…☆13Nov 15, 2023Updated 2 years ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- CookingZoo: a gym-cooking derivative to simulate a complex cooking environment☆21Dec 6, 2024Updated last year
- ☆15May 11, 2023Updated 2 years ago
- @ngrok/mantle ui component library | https://develop.mantle.ngrok.com☆13Updated this week
- Monte Carlo tree search for the travelling salesman problem (MCTS for the TSP)☆12Jun 18, 2022Updated 3 years ago
- This is the code repository for the paper "Zero-Sum Stochastic Stackelberg Games".☆17Oct 12, 2022Updated 3 years ago
- ☆25Feb 23, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Feb 24, 2023Updated 3 years ago
- A library for training crosscoders☆17May 28, 2025Updated 10 months ago
- This is a project for creating and using IL datasets based on HuggingFace weights with multithreads for performance, and benchmarking☆13Mar 10, 2026Updated last month
- Course materials for the Modern Methods for Quantifying Behavior Course!☆13Nov 10, 2023Updated 2 years ago
- Code for replicating experiments from the paper, Preference Exploration for Efficient Bayesian Optimization with Multiple Outcomes, publi…☆13Jun 22, 2023Updated 2 years ago
- A realtime multicellular organism evolution simulator with Verlet integration☆12May 30, 2021Updated 4 years ago
- A powerful keybind library and daemon for Linux.☆11Jul 24, 2022Updated 3 years ago
- ☆11Mar 17, 2024Updated 2 years ago
- ☆14Jun 10, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Brutaltester compatible referee for coders strike back☆12Nov 27, 2018Updated 7 years ago
- Scala Native 3 bindings for SFML library☆15Jul 9, 2023Updated 2 years ago
- ☆18Nov 10, 2023Updated 2 years ago
- Structure refinement software for total scattering data☆14Updated this week
- Code for the paper "Harnessing Discrete Representations for Continual Reinforcement Learning"☆16Jun 16, 2024Updated last year
- Skew Gaussian Processes by Alessio Benavoli, Dario Azzimonti and Dario Piga☆16Aug 5, 2025Updated 8 months ago
- The AI Arena: A framework for distributed multi-agent reinforcement learning☆14Aug 5, 2022Updated 3 years ago
- Metrics for spike sorting validation/quality control☆15Sep 8, 2021Updated 4 years ago
- Tools, algorithms, and frameworks for managing and analyzing neural data.☆20Mar 21, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Home of NumPy user and reference documentation☆13Apr 10, 2026Updated last week
- SpikeGLX postprocessing [tshift filter CAR fix join extract]☆15Apr 6, 2026Updated last week
- This code is used to verify the availability of the dataset and to verify the link between watermelon image sound and sweetness☆16Jul 13, 2025Updated 9 months ago
- Coding course materials for Brain-like computation and intelligence☆17Updated this week
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- This repository contains code used to conduct experiments reported in the paper "Streaming Active Learning with Deep Neural Networks" acc…☆14Mar 7, 2025Updated last year
- Subject of the hackathon 42☆12Nov 9, 2022Updated 3 years ago