lucidrains / q-transformer
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
☆371Updated 2 months ago
Alternatives and similar repositories for q-transformer:
Users that are interested in q-transformer are comparing it to the libraries listed below
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆226Updated 5 months ago
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆233Updated last month
- ☆271Updated 2 years ago
- ☆223Updated 4 months ago
- Implementation of Dreamer v3 in pytorch.☆536Updated 6 months ago
- Really Fast End-to-End Jax RL Implementations☆859Updated 7 months ago
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆538Updated 5 months ago
- Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"☆503Updated last week
- ☆338Updated last year
- A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities☆376Updated this week
- Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"☆135Updated 8 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆85Updated last week
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆267Updated 2 years ago
- Online Decision Transformer☆258Updated last year
- PyTorch implementation of DreamerV2 model-based RL algorithm☆216Updated last year
- ☆264Updated 3 years ago
- Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"☆491Updated 2 years ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆299Updated last month
- a simple and scalable agent for training adaptive policies with sequence-based RL☆118Updated last week
- Simple single-file baselines for Q-Learning in pure-GPU setting☆153Updated 3 weeks ago
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆365Updated 10 months ago
- SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms☆429Updated 3 weeks ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆174Updated 10 months ago
- Multi-Agent Reinforcement Learning with JAX☆559Updated last week
- General multi-task deep RL Agent☆179Updated 10 months ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆86Updated 2 years ago
- Distributed Reinforcement Learning accelerated by Lightning Fabric☆361Updated this week
- An engine for high performance multi-agent environments with very large numbers of agents, along with a set of reference environments☆274Updated last month
- Datasets with baselines for offline multi-agent reinforcement learning.☆162Updated last week
- Code for "Learning to Model the World with Language." ICML 2024 Oral.☆382Updated last year