lucidrains / q-transformer
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
☆368Updated last month
Alternatives and similar repositories for q-transformer:
Users that are interested in q-transformer are comparing it to the libraries listed below
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆222Updated 4 months ago
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆225Updated this week
- Really Fast End-to-End Jax RL Implementations☆841Updated 6 months ago
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆524Updated 4 months ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆294Updated last month
- ☆333Updated last year
- Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"☆485Updated 2 years ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆249Updated 7 months ago
- Multi-Agent Reinforcement Learning with JAX☆541Updated 2 weeks ago
- Implementation of Dreamer v3 in pytorch.☆511Updated 5 months ago
- ☆268Updated 2 years ago
- Code for "Learning to Model the World with Language." ICML 2024 Oral.☆379Updated last year
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆264Updated 2 years ago
- ☆217Updated 4 months ago
- Online Decision Transformer☆251Updated last year
- General multi-task deep RL Agent☆178Updated 9 months ago
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆72Updated 7 months ago
- Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.☆170Updated this week
- Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"☆475Updated 3 weeks ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆160Updated 8 months ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆114Updated last month
- Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"☆130Updated 8 months ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆216Updated last year
- A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities☆362Updated this week
- Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.☆698Updated this week
- ☆79Updated 9 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆149Updated this week
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆170Updated 9 months ago
- 🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL☆300Updated this week