A python package to design and debug RL agents.
☆33Apr 2, 2026Updated 2 months ago
Alternatives and similar repositories for mdp-playground
Users that are interested in mdp-playground are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the code of reproducing the results of our paper: On the importance of Hyperparameter Optimization for Model-based Reinforcement …☆16Aug 19, 2021Updated 4 years ago
- This repository may contain useful scripts for debugging on a remote slurm cluster.☆10Nov 8, 2024Updated last year
- A distributed GPU-centric experience replay system for large AI models.☆19Aug 1, 2023Updated 2 years ago
- Your solution for stiffness problems☆25Apr 5, 2025Updated last year
- ☆91Jan 27, 2026Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- We propose an evolution-based approach to meta-learn synthetic neural environments and reward neural networks for reinforcement learning.☆21Feb 23, 2023Updated 3 years ago
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago
- ☆16Jul 13, 2022Updated 3 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago
- ☆18Jul 25, 2024Updated last year
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- ☆25Sep 23, 2024Updated last year
- A collection of matrix games in JAX☆14Apr 13, 2026Updated 2 months ago
- ☆12Jun 17, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper☆17Jan 3, 2022Updated 4 years ago
- Release doc/tutorial/wheels for poseidon-tf☆10Jan 18, 2018Updated 8 years ago
- I read and summarized an academic paper every day for a year.☆11Dec 27, 2020Updated 5 years ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Dec 26, 2019Updated 6 years ago
- The Starcraft Multi-Agent challenge lite☆48Sep 13, 2024Updated last year
- Implementing different learning algorithms and analyzing their performance in a Markov game model called the Soccer Game☆23Jan 29, 2023Updated 3 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 5 years ago
- ☆30Aug 20, 2021Updated 4 years ago
- ☆77Apr 3, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Minimal codes for "Task-Oriented Dexterous Hand Pose Synthesis Using Differentiable Grasp Wrench Boundary Estimator [IROS 2024]"☆16Feb 12, 2025Updated last year
- ☆15Jul 5, 2025Updated 11 months ago
- PErception and Robotic Learning System v2☆12Mar 31, 2023Updated 3 years ago
- A comprehensive list of papers related to Physics-based and Learning-based Differentiable Simulation for Robotics, including papers, code…☆11Apr 7, 2023Updated 3 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆88Nov 27, 2023Updated 2 years ago
- Python implementation of tabular asynchronous actor critic☆11May 3, 2016Updated 10 years ago
- Experiment code for testing effect of various action space transformations in reinforcement learning☆30May 26, 2020Updated 6 years ago
- Benchmark for evaluating the generalization capabilities of Multi-Objective Reinforcement Learning (MORL) algorithms.☆28Jun 6, 2025Updated last year
- Repository for (for now) filing bug reports about PLAI.☆15Jul 5, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆22Jul 2, 2025Updated 11 months ago
- A tool to automate installing Atari ROMs for the Arcade Learning Environment☆79Aug 11, 2025Updated 10 months ago
- MuJoCo motion planning library.☆22Jun 8, 2026Updated last week
- Illustration of counterfactual inference following Ferenc Huszar example☆13Aug 15, 2025Updated 10 months ago
- This repo contains active learning query strategies as introduced in our GCPR 2013 paper.☆12Aug 12, 2013Updated 12 years ago
- This library provides expression trees for representation of geometric expressions and automatic differentiation of these expressions. Th…☆14Aug 24, 2023Updated 2 years ago
- [CoRL 2024] ScissorBot: Learning Generalizable Scissor Skill for Paper Cutting via Simulation, Imitation, and Sim2Real☆15Dec 25, 2024Updated last year