google-deepmind / disco_rlLinks
Accompanying code for "Discovering State-of-the-art Reinforcement Algorithms" Nature publication
☆410Updated last week
Alternatives and similar repositories for disco_rl
Users that are interested in disco_rl are comparing it to the libraries listed below
Sorting:
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆274Updated 8 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆128Updated 5 months ago
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆99Updated last year
- Implementations of Multi-Task and Meta-Learning baselines for the Metaworld benchmark☆28Updated 3 months ago
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆145Updated 7 months ago
- ☆73Updated last year
- Online Decision Transformer☆274Updated last year
- Synchronized Curriculum Learning for RL Agents☆116Updated last month
- Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —☆260Updated 3 months ago
- ☆88Updated 2 years ago
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆528Updated 3 weeks ago
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆134Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆62Updated 2 years ago
- ☆244Updated last year
- ☆117Updated 2 weeks ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆42Updated last year
- off-policy RL on long sequences☆154Updated 4 months ago
- ☆155Updated 2 weeks ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆276Updated last month
- ☆363Updated 2 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- Simplest and Cleanest DreamerV3 implementation out there☆121Updated 8 months ago
- Unified Implementations of Offline Reinforcement Learning Algorithms☆181Updated this week
- Simple single-file baselines for Q-Learning in pure-GPU setting☆228Updated 3 weeks ago
- Datasets with baselines for Offline MARL.☆191Updated last month
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆118Updated last year
- Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind☆405Updated 5 months ago
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆60Updated 10 months ago
- Repo for Implicit Diffusion Q-Learning☆119Updated 2 years ago
- [ICLR 2025] Learning Transformer-based World Models with Contrastive Predictive Coding (TWISTER)☆42Updated 9 months ago