lucidrains / q-transformer
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
☆374Updated 2 months ago
Alternatives and similar repositories for q-transformer:
Users that are interested in q-transformer are comparing it to the libraries listed below
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆559Updated 6 months ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆228Updated 6 months ago
- Code for "Learning to Model the World with Language." ICML 2024 Oral.☆384Updated last year
- Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"☆495Updated 2 years ago
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆238Updated last month
- ☆340Updated 2 years ago
- Online Decision Transformer☆258Updated last year
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆311Updated 2 months ago
- ☆228Updated 5 months ago
- ☆280Updated 2 years ago
- Really Fast End-to-End Jax RL Implementations☆867Updated 8 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆91Updated last month
- General multi-task deep RL Agent☆181Updated 11 months ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆119Updated last week
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆56Updated 2 months ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆256Updated 8 months ago
- Implementation of Dreamer v3 in pytorch.☆550Updated 7 months ago
- Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"☆135Updated 9 months ago
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆81Updated 9 months ago
- Efficient baselines for autocurricula in JAX.☆187Updated 8 months ago
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆269Updated 2 years ago
- Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments☆279Updated last month
- Implementation of Trajectory Transformer with attention caching and batched beam search☆111Updated 2 years ago
- Multi-Agent Reinforcement Learning with JAX☆571Updated last week
- Simple single-file baselines for Q-Learning in pure-GPU setting☆161Updated last month
- An engine for high performance multi-agent environments with very large numbers of agents, along with a set of reference environments☆283Updated 2 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆163Updated last year
- Distributed Reinforcement Learning accelerated by Lightning Fabric☆367Updated this week
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆365Updated 10 months ago
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆115Updated last week