A python package to design and debug RL agents.
☆33Jan 15, 2026Updated last month
Alternatives and similar repositories for mdp-playground
Users that are interested in mdp-playground are comparing it to the libraries listed below
Sorting:
- This is the code of reproducing the results of our paper: On the importance of Hyperparameter Optimization for Model-based Reinforcement …☆16Aug 19, 2021Updated 4 years ago
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 2 years ago
- ☆16Jul 13, 2022Updated 3 years ago
- Release doc/tutorial/wheels for poseidon-tf☆10Jan 18, 2018Updated 8 years ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Dec 26, 2019Updated 6 years ago
- ☆12Jun 17, 2022Updated 3 years ago
- Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper☆17Jan 3, 2022Updated 4 years ago
- A collection of matrix games in JAX☆13Nov 28, 2024Updated last year
- ☆18Jul 25, 2024Updated last year
- Your solution for stiffness problems☆24Apr 5, 2025Updated 11 months ago
- ☆91Jan 27, 2026Updated last month
- The Starcraft Multi-Agent challenge lite☆47Sep 13, 2024Updated last year
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 5 years ago
- We propose an evolution-based approach to meta-learn synthetic neural environments and reward neural networks for reinforcement learning.☆21Feb 23, 2023Updated 3 years ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- Getting Starting with NIMBUS-CORE☆10Dec 16, 2023Updated 2 years ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆56Jan 20, 2023Updated 3 years ago
- ☆25Sep 23, 2024Updated last year
- Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment☆20Dec 2, 2025Updated 3 months ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- StarCraft 2 Imitation Learning☆29Jul 2, 2021Updated 4 years ago
- A tool to automate installing Atari ROMs for the Arcade Learning Environment☆79Aug 11, 2025Updated 6 months ago
- Experiment code for testing effect of various action space transformations in reinforcement learning☆30May 26, 2020Updated 5 years ago
- ☆30Aug 20, 2021Updated 4 years ago
- (AAAI'2019) The codes, models, logs, and data for an extended paper of the original paper "On Reinforcement Learning for Full-length Game…☆31Oct 5, 2022Updated 3 years ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Dec 9, 2022Updated 3 years ago
- Official code repository for the MICCAI 2025 paper "UltraRay: Introducing Full-Path Ray Tracing in Physics-Based Ultrasound Simulation"☆17Aug 13, 2025Updated 6 months ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆44Dec 11, 2021Updated 4 years ago
- An interactive framework to visualize and analyze your AutoML process in real-time.☆94Updated this week
- ☆78Apr 3, 2024Updated last year
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- ☆12Jul 5, 2025Updated 8 months ago
- SineKAN: Kolmogorov-Arnold Networks Using Sinusoidal Activation Functions☆15Dec 19, 2024Updated last year
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- Datacenter simulation toolkit for the OpenDC project☆10Aug 24, 2020Updated 5 years ago
- A Texas Holdem poker framework written in C++ 20.☆11Apr 23, 2023Updated 2 years ago
- ☆21Jul 2, 2025Updated 8 months ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆100Jul 5, 2023Updated 2 years ago