thiagopbueno / rddlgym
A toolkit for working with RDDL domains in Python3.
☆17Updated 4 years ago
Alternatives and similar repositories for rddlgym:
Users that are interested in rddlgym are comparing it to the libraries listed below
- A toolkit for auto-generation of OpenAI Gym environments from RDDL description files.☆76Updated this week
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆48Updated 2 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆55Updated last year
- Probabilistic planning in continuous state-action MDPs in TensorFlow.☆12Updated 2 years ago
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago
- ☆30Updated last year
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆33Updated 2 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆20Updated 3 years ago
- AGAC: Adversarially Guided Actor-Critic☆48Updated 3 years ago
- Deep RL agents with PyTorch☆35Updated 3 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆85Updated 3 years ago
- Disagreement-Regularized Imitation Learning☆30Updated 3 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- ☆34Updated 4 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆22Updated 10 months ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆36Updated 2 years ago
- ☆10Updated 3 years ago
- ☆47Updated 4 years ago
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)☆18Updated 2 years ago
- Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play☆14Updated 6 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26Updated 4 years ago
- ☆28Updated 4 years ago
- Baselines for gymnax 🤖☆63Updated last year
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- Code for generating options for planning and reinforcement learning☆11Updated 4 years ago
- ☆35Updated 2 years ago
- A Library of MDP algorithms for Artificial Intelligence☆18Updated 5 years ago
- Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"☆45Updated last year
- Deep Reinforcement Learning Framework done with PyTorch☆32Updated this week