lucidrains / q-transformer
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
☆362Updated this week
Alternatives and similar repositories for q-transformer:
Users that are interested in q-transformer are comparing it to the libraries listed below
- Really Fast End-to-End Jax RL Implementations☆808Updated 5 months ago
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆212Updated 2 weeks ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆211Updated 3 months ago
- ☆259Updated 2 years ago
- Online Decision Transformer☆248Updated last year
- ☆327Updated last year
- Implementation of Dreamer v3 in pytorch.☆481Updated 4 months ago
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆504Updated 3 months ago
- ☆213Updated 2 months ago
- Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"☆478Updated 2 years ago
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆256Updated 2 years ago
- Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"☆444Updated last week
- General multi-task deep RL Agent☆176Updated 8 months ago
- Multi-Agent Reinforcement Learning with JAX☆499Updated this week
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆275Updated this week
- Datasets with baselines for offline multi-agent reinforcement learning.☆160Updated last week
- PyTorch implementation of DreamerV2 model-based RL algorithm☆215Updated last year
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆154Updated 7 months ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆110Updated last week
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆243Updated 5 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆137Updated 2 months ago
- ☆71Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆187Updated 5 months ago
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆355Updated 8 months ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆165Updated 7 months ago
- A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities☆342Updated 2 weeks ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 5 months ago
- Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"☆129Updated 6 months ago
- A collection of MARL benchmarks based on TorchRL☆341Updated this week
- An API conversion tool for popular external reinforcement learning environments☆151Updated last month