A python package to design and debug RL agents.
☆33Apr 2, 2026Updated last month
Alternatives and similar repositories for mdp-playground
Users that are interested in mdp-playground are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the code of reproducing the results of our paper: On the importance of Hyperparameter Optimization for Model-based Reinforcement …☆16Aug 19, 2021Updated 4 years ago
- This repository may contain useful scripts for debugging on a remote slurm cluster.☆10Nov 8, 2024Updated last year
- Your solution for stiffness problems☆25Apr 5, 2025Updated last year
- ☆91Jan 27, 2026Updated 3 months ago
- We propose an evolution-based approach to meta-learn synthetic neural environments and reward neural networks for reinforcement learning.☆21Feb 23, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago
- ☆16Jul 13, 2022Updated 3 years ago
- Release doc/tutorial/wheels for poseidon-tf☆10Jan 18, 2018Updated 8 years ago
- ☆18Jul 25, 2024Updated last year
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- A collection of matrix games in JAX☆13Apr 13, 2026Updated 3 weeks ago
- ☆12Jun 17, 2022Updated 3 years ago
- Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper☆17Jan 3, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Dec 26, 2019Updated 6 years ago
- The Starcraft Multi-Agent challenge lite☆47Sep 13, 2024Updated last year
- Implementing different learning algorithms and analyzing their performance in a Markov game model called the Soccer Game☆23Jan 29, 2023Updated 3 years ago
- Launching and monitoring Slurm experiments in Python☆26Mar 30, 2026Updated last month
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 5 years ago
- ☆30Aug 20, 2021Updated 4 years ago
- ☆77Apr 3, 2024Updated 2 years ago
- Minimal codes for "Task-Oriented Dexterous Hand Pose Synthesis Using Differentiable Grasp Wrench Boundary Estimator [IROS 2024]"☆15Feb 12, 2025Updated last year
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆58Jan 20, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Jul 5, 2025Updated 10 months ago
- PErception and Robotic Learning System v2☆12Mar 31, 2023Updated 3 years ago
- A comprehensive list of papers related to Physics-based and Learning-based Differentiable Simulation for Robotics, including papers, code…☆11Apr 7, 2023Updated 3 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆87Nov 27, 2023Updated 2 years ago
- Python implementation of tabular asynchronous actor critic☆11May 3, 2016Updated 10 years ago
- ViViDex implementation under the SAPIEN simulator, ICRA 2025☆18Apr 9, 2025Updated last year
- Experiment code for testing effect of various action space transformations in reinforcement learning☆30May 26, 2020Updated 5 years ago
- Benchmark for evaluating the generalization capabilities of Multi-Objective Reinforcement Learning (MORL) algorithms.☆26Jun 6, 2025Updated 11 months ago
- Repository for (for now) filing bug reports about PLAI.☆15Jul 5, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆13May 25, 2023Updated 2 years ago
- A tool to automate installing Atari ROMs for the Arcade Learning Environment☆79Aug 11, 2025Updated 8 months ago
- MuJoCo motion planning library.☆22Apr 27, 2026Updated last week
- [IROS 2025] Official repository of GRIP: A General Robotic Incremental Potential Contact Simulation Dataset for Unified Deformable-Rigid …☆19Jul 10, 2025Updated 9 months ago
- Illustration of counterfactual inference following Ferenc Huszar example☆13Aug 15, 2025Updated 8 months ago
- This repo contains active learning query strategies as introduced in our GCPR 2013 paper.☆12Aug 12, 2013Updated 12 years ago
- JVRC1 model files for MuJoCo☆10Mar 24, 2026Updated last month