facebookresearch / online-dtLinks
Online Decision Transformer
☆272Updated last year
Alternatives and similar repositories for online-dt
Users that are interested in online-dt are comparing it to the libraries listed below
Sorting:
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆280Updated 3 years ago
- Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"☆518Updated 3 years ago
- ☆289Updated 3 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆164Updated 2 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆374Updated 3 years ago
- Code for conservative Q-learning☆461Updated 3 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆218Updated last year
- An elegant PyTorch offline reinforcement learning library for researchers.☆363Updated 3 months ago
- A collection of offline reinforcement learning algorithms.☆203Updated 11 months ago
- ☆114Updated 2 years ago
- ☆355Updated 2 years ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆181Updated 2 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆141Updated last year
- Implementation of Trajectory Transformer with attention caching and batched beam search☆116Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆151Updated 2 years ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆230Updated 2 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆337Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆179Updated 3 years ago
- Multi Task RL Baselines☆255Updated 3 years ago
- Re-implementations of SOTA RL algorithms.☆135Updated 2 years ago
- Official code repository for Prompt-DT.☆116Updated 3 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆193Updated last year
- ☆240Updated 11 months ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆363Updated 2 years ago
- Datasets with baselines for Offline MARL.☆182Updated this week
- Prioritized Experience Replay implementation with proportional prioritization☆84Updated 2 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆170Updated last year
- PyTorch implementation of GAIL and AIRL based on PPO.☆227Updated 4 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆188Updated 3 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆92Updated last year