facebookresearch / online-dtLinks
Online Decision Transformer
☆267Updated last year
Alternatives and similar repositories for online-dt
Users that are interested in online-dt are comparing it to the libraries listed below
Sorting:
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆279Updated 3 years ago
- Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"☆516Updated 2 years ago
- ☆282Updated 3 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆163Updated last year
- Code for conservative Q-learning☆455Updated 3 years ago
- ☆349Updated 2 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆373Updated 3 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆211Updated last year
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆180Updated 2 years ago
- A collection of offline reinforcement learning algorithms.☆196Updated 9 months ago
- ☆114Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆148Updated 2 years ago
- An elegant PyTorch offline reinforcement learning library for researchers.☆358Updated 2 months ago
- ☆237Updated 10 months ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆141Updated last year
- Implementation of Trajectory Transformer with attention caching and batched beam search☆116Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆180Updated 3 years ago
- Official code repository for Prompt-DT.☆115Updated 3 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆331Updated last year
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆165Updated last year
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆358Updated 2 years ago
- Datasets with baselines for Offline MARL.☆178Updated 3 weeks ago
- ☆201Updated 2 years ago
- Multi Task RL Baselines☆250Updated 3 years ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆227Updated 2 years ago
- NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms☆376Updated last year
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆130Updated 3 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆187Updated last year
- Paper Collection for Batch RL with brief introductions.☆84Updated 3 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆188Updated 3 years ago