lucidrains / q-transformerView external linksLinks
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
☆406Jun 20, 2025Updated 7 months ago
Alternatives and similar repositories for q-transformer
Users that are interested in q-transformer are comparing it to the libraries listed below
Sorting:
- Implementation of the Llama architecture with RLHF + Q-learning☆170Feb 1, 2025Updated last year
- Implementation of Diffusion Policy, Toyota Research's supposed breakthrough in leveraging DDPMs for learning policies for real-world Robo…☆134Jul 6, 2024Updated last year
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆171Jul 7, 2024Updated last year
- High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…☆1,323Aug 3, 2023Updated 2 years ago
- [WIP] Transformer to embed Danbooru labelsets☆13Mar 31, 2024Updated last year
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Oct 22, 2023Updated 2 years ago
- A curated list of Decision Transformer resources (continually updated)☆871Dec 15, 2025Updated 2 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆109Oct 27, 2024Updated last year
- Simplex Random Feature attention, in PyTorch☆76Oct 10, 2023Updated 2 years ago
- Implementation of Infini-Transformer in Pytorch☆112Jan 4, 2025Updated last year
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Mar 26, 2024Updated last year
- Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.☆2,769Apr 29, 2024Updated last year
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆74Oct 18, 2022Updated 3 years ago
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆91Dec 22, 2023Updated 2 years ago
- An index of algorithms for offline reinforcement learning (offline-rl)☆1,048May 23, 2024Updated last year
- Implementation of the transformer proposed in "Building Blocks for a Complex-Valued Transformer Architecture"☆88Oct 13, 2023Updated 2 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆96May 21, 2023Updated 2 years ago
- Implementation of RT1 (Robotic Transformer) in Pytorch☆446Oct 6, 2024Updated last year
- Implementation of a holodeck, written in Pytorch☆18Nov 1, 2023Updated 2 years ago
- ☆48Jul 22, 2024Updated last year
- [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"☆119Feb 11, 2025Updated last year
- A collection of reference environments for offline reinforcement learning☆1,646Nov 18, 2024Updated last year
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆143Jun 23, 2025Updated 7 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Feb 7, 2026Updated last week
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆47Jul 27, 2023Updated 2 years ago
- Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI☆293Jun 3, 2025Updated 8 months ago
- Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"☆758May 21, 2025Updated 8 months ago
- Online Decision Transformer☆274Jan 22, 2024Updated 2 years ago
- Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch☆344Apr 2, 2025Updated 10 months ago
- Public-facing code originally developed by the AI Institute for deploying RL development code on our robot (this is part of the effort to…☆34Jun 5, 2024Updated last year
- Implementation of the algorithm detailed in paper "Evolutionary design of molecules based on deep learning and a genetic algorithm"☆24Dec 15, 2023Updated 2 years ago
- ☆29Oct 3, 2023Updated 2 years ago
- This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Re…☆651Nov 29, 2024Updated last year
- Brainwave is a state-of-the-art neural decoder that transforms electroencephalogram (EEG) and brain signals into multimodal outputs inclu…☆14Oct 6, 2025Updated 4 months ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- An implementation of PPO in Pytorch☆106Jan 7, 2026Updated last month
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆123Oct 17, 2024Updated last year
- Implementation of AutoRT: "AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents"☆42Nov 11, 2024Updated last year