google-deepmind / disco_rlLinks
Accompanying code for "Discovering State-of-the-art Reinforcement Algorithms" Nature publication
☆579Updated last month
Alternatives and similar repositories for disco_rl
Users that are interested in disco_rl are comparing it to the libraries listed below
Sorting:
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆272Updated 9 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆130Updated 6 months ago
- ☆205Updated last month
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆100Updated last year
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆148Updated 8 months ago
- Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —☆265Updated 4 months ago
- Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind☆405Updated 6 months ago
- Unified Implementations of Offline Reinforcement Learning Algorithms☆184Updated 2 weeks ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆231Updated last month
- Online Decision Transformer☆274Updated last year
- Synchronized Curriculum Learning for RL Agents☆117Updated 2 months ago
- Simplest and Cleanest DreamerV3 implementation out there☆124Updated 9 months ago
- Implementations of Multi-Task and Meta-Learning baselines for the Metaworld benchmark☆29Updated 4 months ago
- ☆363Updated 2 years ago
- ☆73Updated 2 years ago
- ☆120Updated last month
- The official implementation of flow Q-learning (FQL)☆268Updated 5 months ago
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆63Updated last week
- Lecture slides for the MARL book (www.marl-book.com)☆145Updated 7 months ago
- off-policy RL on long sequences☆155Updated last week
- Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"☆149Updated last year
- On-Policy Policy Gradient Algorithms in JAX☆42Updated last year
- [ICLR 2025] Learning Transformer-based World Models with Contrastive Predictive Coding (TWISTER)☆43Updated 9 months ago
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆666Updated 4 months ago
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆535Updated last month
- Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.☆226Updated 2 months ago
- ☆414Updated last year
- ☆114Updated 10 months ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- A benchmark for offline goal-conditioned RL and offline RL☆304Updated 2 months ago