ubisoft / DirectBehaviorSpecificationLinks
Code to reproduce the Arena environment experiments from Direct Behavior Specification via Constrained Reinforcement Learning.
☆22Updated 3 years ago
Alternatives and similar repositories for DirectBehaviorSpecification
Users that are interested in DirectBehaviorSpecification are comparing it to the libraries listed below
Sorting:
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆85Updated 2 weeks ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆116Updated 2 years ago
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020☆45Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆183Updated 3 years ago
- Deep Hierarchical Planning from Pixels☆113Updated 3 years ago
- ☆202Updated 2 years ago
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆32Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Updated last year
- ☆48Updated 2 months ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆226Updated last year
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆139Updated 3 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆155Updated 4 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆125Updated 3 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆137Updated 4 months ago
- A pytorch implementation of Dreamer☆24Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆162Updated 2 years ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆167Updated last year
- Benchmarking RL generalization in an interpretable way.☆174Updated 2 months ago
- behavior cloning from observation☆38Updated 5 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆191Updated 3 years ago
- ☆54Updated 2 years ago
- Representation Learning for RL☆130Updated 2 years ago
- ☆32Updated 4 years ago
- ☆59Updated 2 years ago
- Simple maze environments using mujoco-py☆58Updated 2 years ago
- Author's PyTorch implementation of LAP and PAL with TD3 and DDQN☆38Updated 4 years ago
- ☆19Updated 3 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆99Updated 5 years ago
- A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…☆40Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆71Updated 6 months ago