lucidrains / q-transformerLinks
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
☆406Updated 7 months ago
Alternatives and similar repositories for q-transformer
Users that are interested in q-transformer are comparing it to the libraries listed below
Sorting:
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆273Updated 10 months ago
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆668Updated 5 months ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆243Updated last month
- Code for "Learning to Model the World with Language." ICML 2024 Oral.☆415Updated 2 weeks ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆131Updated 7 months ago
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆100Updated last year
- Distributed Reinforcement Learning accelerated by Lightning Fabric☆418Updated last week
- Accompanying code for "Discovering State-of-the-art Reinforcement Algorithms" Nature publication☆618Updated last month
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆275Updated 2 months ago
- Online Decision Transformer☆273Updated 2 years ago
- ☆73Updated 2 years ago
- ☆249Updated last year
- General multi-task deep RL Agent☆185Updated last year
- off-policy RL on long sequences☆155Updated this week
- Efficient baselines for autocurricula in JAX.☆206Updated last year
- Really Fast End-to-End Jax RL Implementations☆1,013Updated last year
- Simple single-file baselines for Q-Learning in pure-GPU setting☆232Updated 2 months ago
- ☆122Updated 2 months ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆364Updated 6 months ago
- Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.☆226Updated last week
- An API conversion tool for popular external reinforcement learning environments☆201Updated last month
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆204Updated last year
- ☆363Updated 2 years ago
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆64Updated 3 weeks ago
- A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities☆480Updated 2 weeks ago
- Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"☆526Updated 3 years ago
- Multi-Agent Reinforcement Learning with JAX☆726Updated last week
- Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments☆296Updated 9 months ago
- Simplest and Cleanest DreamerV3 implementation out there☆127Updated 10 months ago
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆286Updated 3 years ago