lucidrains / q-transformerLinks

Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind

☆391

Alternatives and similar repositories for q-transformer

Users that are interested in q-transformer are comparing it to the libraries listed below

Sorting:

mohmdelsayed / streaming-drl
Deep reinforcement learning without experience replay, target networks, or batch updates.
☆259Updated 4 months ago
flowersteam / lamorel
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
☆236Updated 9 months ago
jlin816 / dynalang
Code for "Learning to Model the World with Language." ICML 2024 Oral.
☆387Updated last year
flowersteam / Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
☆268Updated 11 months ago
openrlbenchmark / openrlbenchmark
☆235Updated 8 months ago
mlech26l / gigastep
☆73Updated last year
facebookresearch / online-dt
Online Decision Transformer
☆263Updated last year
facebookresearch / MRQ
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆106Updated last month
pytorch-labs / LeanRL
LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.
☆608Updated 9 months ago
huggingface / jat
General multi-task deep RL Agent
☆183Updated last year
FLAIROx / Kinetix
Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.
☆200Updated 2 weeks ago
Shengjiewang-Jason / EfficientZeroV2
[ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data
☆91Updated 11 months ago
UT-Austin-RPL / amago
off-policy RL on long sequences
☆133Updated this week
MichaelTMatthews / Craftax
(Crafter + NetHack) in JAX. ICML 2024 Spotlight.
☆330Updated last month
FLAIROx / JaxMARL
Multi-Agent Reinforcement Learning with JAX
☆611Updated last month
anuragajay / decision-diffuser
☆343Updated 2 years ago
facebookresearch / minimax
Efficient baselines for autocurricula in JAX.
☆191Updated 11 months ago
Eclectic-Sheep / sheeprl
Distributed Reinforcement Learning accelerated by Lightning Fabric
☆382Updated last week
DHDev0 / Muzero-unplugged
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…
☆31Updated last month
mttga / purejaxql
Simple single-file baselines for Q-Learning in pure-GPU setting
☆174Updated 4 months ago
ikostrikov / rlpd
☆307Updated 2 years ago
weipu-zhang / STORM
☆93Updated last year
nikhilbarhate99 / min-decision-transformer
Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…
☆278Updated 3 years ago
facebookresearch / motif
Intrinsic Motivation from Artificial Intelligence Feedback
☆130Updated last year
vmicheli / delta-iris
Efficient World Models with Context-Aware Tokenization. ICML 2024
☆105Updated 10 months ago
NM512 / dreamerv3-torch
Implementation of Dreamer v3 in pytorch.
☆616Updated 10 months ago
Farama-Foundation / Minari
A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities
☆414Updated 3 weeks ago
InexperiencedMe / NaturalDreamer
Simplest and Cleanest DreamerV3 implementation out there
☆80Updated 4 months ago
MarcoMeter / episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
☆183Updated last year
EmptyJackson / policy-guided-diffusion
Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"
☆137Updated last year