lucidrains / q-transformerLinks
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
☆382Updated this week
Alternatives and similar repositories for q-transformer
Users that are interested in q-transformer are comparing it to the libraries listed below
Sorting:
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆234Updated 7 months ago
- Online Decision Transformer☆259Updated last year
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆597Updated 8 months ago
- ☆289Updated 2 years ago
- ☆343Updated 2 years ago
- Code for "Learning to Model the World with Language." ICML 2024 Oral.☆388Updated last year
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆263Updated 10 months ago
- Really Fast End-to-End Jax RL Implementations☆898Updated 9 months ago
- ☆232Updated 7 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆104Updated 2 months ago
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆254Updated 3 months ago
- Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"☆502Updated 2 years ago
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆275Updated 3 years ago
- Implementation of Dreamer v3 in pytorch.☆577Updated 8 months ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆163Updated 11 months ago
- Distributed Reinforcement Learning accelerated by Lightning Fabric☆380Updated 2 weeks ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆319Updated last month
- General multi-task deep RL Agent☆184Updated last year
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆126Updated last month
- ☆271Updated 3 years ago
- Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"☆569Updated last month
- A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities☆398Updated 2 weeks ago
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆434Updated 9 months ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆219Updated 2 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆180Updated last year
- ☆264Updated 2 years ago
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆368Updated last year
- Multi-Agent Reinforcement Learning with JAX☆599Updated 3 weeks ago
- This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Re…☆575Updated 6 months ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆128Updated 3 weeks ago