thiagopbueno / rddlgymLinks
A toolkit for working with RDDL domains in Python3.
☆17Updated 4 years ago
Alternatives and similar repositories for rddlgym
Users that are interested in rddlgym are comparing it to the libraries listed below
Sorting:
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆55Updated last year
- Reinforcement Learning framework for Temporal Goals☆11Updated 2 years ago
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆11Updated 3 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆28Updated 5 years ago
- A toolkit for auto-generation of OpenAI Gym environments from RDDL description files.☆84Updated last week
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- AGAC: Adversarially Guided Actor-Critic☆49Updated 3 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51Updated 4 years ago
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)☆18Updated 2 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆36Updated 2 months ago
- ☆30Updated last year
- Mirror Descent Policy Optimization☆38Updated 4 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆23Updated last year
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆21Updated 3 years ago
- ☆29Updated 4 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆13Updated 2 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- ☆47Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆26Updated last year
- ☆29Updated 2 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 7 months ago
- Code for generating options for planning and reinforcement learning☆12Updated 4 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated 2 years ago
- Implementation of Relational Deep Reinforcement Learning☆25Updated 5 years ago
- Model-based reinforcement learning in TensorFlow☆56Updated 3 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆47Updated 4 years ago
- on-policy optimization baselines for deep reinforcement learning☆30Updated 5 years ago
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆24Updated 3 years ago