lucidrains / q-transformerLinks
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
☆389Updated 3 weeks ago
Alternatives and similar repositories for q-transformer
Users that are interested in q-transformer are comparing it to the libraries listed below
Sorting:
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆256Updated 3 months ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆234Updated 8 months ago
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆602Updated 8 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆104Updated 3 weeks ago
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆89Updated 11 months ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆322Updated last week
- Code for "Learning to Model the World with Language." ICML 2024 Oral.☆388Updated last year
- General multi-task deep RL Agent☆183Updated last year
- Online Decision Transformer☆261Updated last year
- ☆234Updated 7 months ago
- Distributed Reinforcement Learning accelerated by Lightning Fabric☆380Updated last week
- ☆73Updated last year
- Implementation of Dreamer v3 in pytorch.☆592Updated 9 months ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆266Updated 10 months ago
- Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.☆194Updated last week
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆105Updated 9 months ago
- ☆342Updated 2 years ago
- Efficient baselines for autocurricula in JAX.☆189Updated 10 months ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆131Updated last week
- Multi-Agent Reinforcement Learning with JAX☆605Updated last week
- A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities☆408Updated last month
- Really Fast End-to-End Jax RL Implementations☆908Updated 10 months ago
- ☆298Updated 2 years ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆129Updated last year
- Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"☆137Updated 11 months ago
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆60Updated 5 months ago
- Unofficial Gato: A Generalist Agent☆214Updated last year
- Simple single-file baselines for Q-Learning in pure-GPU setting☆173Updated 3 months ago
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆276Updated 3 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆183Updated last year