lucidrains / q-transformer
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
☆341Updated last month
Related projects ⓘ
Alternatives and complementary repositories for q-transformer
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆195Updated last week
- Really Fast End-to-End Jax RL Implementations☆717Updated 2 months ago
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆432Updated 2 weeks ago
- Multi-Agent Reinforcement Learning with JAX☆438Updated this week
- GPU-acceleration of Nocturne via Madrona☆227Updated this week
- ☆200Updated 9 months ago
- ☆223Updated last year
- Implementation of Dreamer v3 in pytorch.☆421Updated last month
- Efficient baselines for autocurricula in JAX.☆173Updated 2 months ago
- Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"☆463Updated 2 years ago
- Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"☆391Updated this week
- Code for "Learning to Model the World with Language." ICML 2024 Oral.☆363Updated last year
- ☆301Updated last year
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆203Updated 3 weeks ago
- An engine for high performance multi-agent environments with very large numbers of agents, along with a set of reference environments☆227Updated last week
- Online Decision Transformer☆238Updated 9 months ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆209Updated last year
- Datasets with baselines for offline multi-agent reinforcement learning.☆139Updated this week
- A collection of MARL benchmarks based on TorchRL☆276Updated last week
- General multi-task deep RL Agent☆164Updated 5 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 2 months ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆219Updated 2 months ago
- 🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL☆230Updated 3 weeks ago
- SBX: Stable Baselines Jax (SB3 + Jax)☆337Updated last week
- Implementation of Trajectory Transformer with attention caching and batched beam search☆107Updated last year
- ☆66Updated 10 months ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆152Updated 4 months ago
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆244Updated 2 years ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆89Updated this week