A reinforcement leaning environment for discrete MDPs.
☆25Nov 10, 2024Updated last year
Alternatives and similar repositories for matrix-mdp-gym
Users that are interested in matrix-mdp-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Library that provides environments for planning problems☆16Apr 24, 2026Updated last month
- PyTorch Implementation of the Sequential Multiagent Rollout algorithm☆11Jun 28, 2024Updated last year
- ☆15Sep 22, 2023Updated 2 years ago
- Simple Grid Environment for Gymnasium☆66Mar 1, 2026Updated 2 months ago
- Cost-aware Bayesian optimization via the Pandora's box Gittins index☆14Aug 8, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for the paper "Learning to Do or Learning While Doing: Reinforcement Learning and Bayesian Optimisation for Online Continuous Tuning…☆13Nov 15, 2023Updated 2 years ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- CookingZoo: a gym-cooking derivative to simulate a complex cooking environment☆22Dec 6, 2024Updated last year
- Reproducible code for paper "qEUBO A Decision-Theoretic Acquisition Function for Preferential Bayesian Optimization" from AISTATS 2023☆22Mar 24, 2023Updated 3 years ago
- An environment for table-carrying, a joint-action cooperative task.☆10Jan 8, 2024Updated 2 years ago
- ppx_system is a syntax extension to known operating system at compile time☆12May 9, 2023Updated 3 years ago
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year
- Monte Carlo tree search for the travelling salesman problem (MCTS for the TSP)☆12Jun 18, 2022Updated 3 years ago
- This is the code repository for the paper "Zero-Sum Stochastic Stackelberg Games".☆17Oct 12, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- JAX/Haiku implementation of "Auction Learning as a Two-Player Game"☆11Jul 6, 2024Updated last year
- ☆13Mar 12, 2024Updated 2 years ago
- ☆27Feb 23, 2026Updated 3 months ago
- NWB Explorer is a web application to visualise and analyse the content of NWB:N 2 files☆27Aug 28, 2025Updated 9 months ago
- ☆13Jun 30, 2020Updated 5 years ago
- Runtime library and schema compiler for the Avro serialization format☆21Dec 13, 2021Updated 4 years ago
- A realtime multicellular organism evolution simulator with Verlet integration☆12May 30, 2021Updated 5 years ago
- A powerful keybind library and daemon for Linux.☆11Jul 24, 2022Updated 3 years ago
- ☆12Mar 17, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Brutaltester compatible referee for coders strike back☆12Nov 27, 2018Updated 7 years ago
- Scala Native 3 bindings for SFML library☆15Jul 9, 2023Updated 2 years ago
- Collection of tools to handle Neuropixel 1.0 and 2.0 data☆19May 15, 2026Updated 2 weeks ago
- ☆18Nov 10, 2023Updated 2 years ago
- ☆18Dec 10, 2025Updated 5 months ago
- Code for the paper "Harnessing Discrete Representations for Continual Reinforcement Learning"☆16Jun 16, 2024Updated last year
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆14May 17, 2024Updated 2 years ago
- Structure refinement software for total scattering data☆15May 21, 2026Updated last week
- Create Custom GYM Environment for SUMO and reinforcement learning agant☆15May 5, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Skew Gaussian Processes by Alessio Benavoli, Dario Azzimonti and Dario Piga☆16Aug 5, 2025Updated 9 months ago
- Tools for optimizing steering vectors in LLMs.☆22Apr 10, 2025Updated last year
- Run adaptive decision making experiments☆16Nov 9, 2021Updated 4 years ago
- Metrics for spike sorting validation/quality control☆15Sep 8, 2021Updated 4 years ago
- ☆17Jul 9, 2025Updated 10 months ago
- Mesoscale activity ephys ingest schema☆11Jul 18, 2023Updated 2 years ago
- SpikeGLX postprocessing [tshift filter CAR fix join extract]☆16May 5, 2026Updated 3 weeks ago