google-deepmind/disco_rl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google-deepmind/disco_rl)

google-deepmind / disco_rl

Accompanying code for "Discovering State-of-the-art Reinforcement Algorithms" Nature publication

☆714

Alternatives and similar repositories for disco_rl

Users that are interested in disco_rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

EmptyJackson / unifloral
View on GitHub
Unified Implementations of Offline Reinforcement Learning Algorithms
☆223Dec 19, 2025Updated 6 months ago
FLAIROx / popjym
View on GitHub
POPGym Library in JAX
☆14Apr 15, 2024Updated 2 years ago
Asap7772 / OfflineRlWorkflow
View on GitHub
This repository accompanies the following paper: A Workflow for Offline Model-Free Robotic RL
☆13Nov 5, 2021Updated 4 years ago
isarlab-department-engineering / d-vat
View on GitHub
Code Repositoritory of the paper D-VAT: End-to-End Visual Active Tracking for Micro Aerial Vehicles
☆19Dec 23, 2025Updated 6 months ago
ethanluoyc / corax
View on GitHub
Corax: Core RL in JAX
☆42Feb 22, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
AQ-MedAI / MrlX
View on GitHub
MrlX: A Multi-Agent Reinforcement Learning Framework
☆211Jan 19, 2026Updated 5 months ago
instadeepai / qd-skill-discovery-benchmark
View on GitHub
Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery
☆17Apr 2, 2026Updated 3 months ago
jinpz / q_sharp
View on GitHub
The official code release for Q#: Provably Optimal Distributional RL for LLM Post-Training
☆20Mar 4, 2025Updated last year
Princeton-RL / normalising-flows-4-reinforcement-learning
View on GitHub
Code for the paper Normalizing Flows are Capable Models for RL
☆20Jun 3, 2025Updated last year
SonyResearch / simba
View on GitHub
☆128Feb 25, 2025Updated last year
roger-creus / ale-nl
View on GitHub
A framework for evaluating LLMs in Atari games
☆15Apr 21, 2025Updated last year
mttga / purejaxql
View on GitHub
Simple single-file baselines for Q-Learning in pure-GPU setting
☆243Nov 24, 2025Updated 7 months ago
mohmdelsayed / streaming-drl
View on GitHub
Deep reinforcement learning without experience replay, target networks, or batch updates.
☆290Mar 18, 2025Updated last year
DAVIAN-Robotics / SimbaV2
View on GitHub
Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"
☆106Nov 4, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
allenai / fluid-benchmarking
View on GitHub
Fluid Language Model Benchmarking
☆29Sep 16, 2025Updated 9 months ago
MarquisDarwin / EAWM
View on GitHub
[ICLR 2026] From Observations to Events: Event-Aware World Models for Reinforcement Learning
☆48May 30, 2026Updated last month
rosieyzh / openrlhf-pretrain
View on GitHub
Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"
☆29Oct 14, 2025Updated 8 months ago
instadeepai / og-marl
View on GitHub
Datasets with baselines for Offline MARL.
☆218Nov 2, 2025Updated 8 months ago
PeideHuang / gradient
View on GitHub
Code base for NeurIPS 2022 paper Curriculum Reinforcement Learning using Optimal Transport via Gradual Domain Adaptation.
☆11Aug 21, 2023Updated 2 years ago
arnavkj1995 / SFM
View on GitHub
Official Implementation of SFM and the baselines in Jax.
☆21May 31, 2025Updated last year
tajwarfahim / maxrl
View on GitHub
Official Implementation of "Maximum Likelihood Reinforcement Learning (MaxRL)"
☆192May 28, 2026Updated last month
DramaCow / jaxued
View on GitHub
☆98Jan 21, 2026Updated 5 months ago
seohongpark / ogbench
View on GitHub
A benchmark for offline goal-conditioned RL and offline RL
☆427Jan 14, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zhougroup / IDAC
View on GitHub
Implicit Distributional Actor Critic
☆11Dec 8, 2021Updated 4 years ago
nissymori / JAX-CORL
View on GitHub
Clean single-file implementation of offline RL algorithms in JAX
☆182Jun 5, 2026Updated last month
enjeeneer / zero-shot-rl
View on GitHub
VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)
☆29Jan 14, 2025Updated last year
OffDynamicsRL / off-dynamics-rl
View on GitHub
☆65Jan 30, 2026Updated 5 months ago
askolik / quantum_agents
View on GitHub
Code for Q-learning with parametrized quantum circuits in OpenAI Gym environments.
☆14Nov 12, 2021Updated 4 years ago
AutonomousAgentsLab / curiousreplay
View on GitHub
Implementations of Curious Replay for model-based adaptation.
☆43Jul 5, 2023Updated 3 years ago
Farama-Foundation / Minari
View on GitHub
A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities
☆518Updated this week
polixir / NeoRL2
View on GitHub
☆20Oct 27, 2025Updated 8 months ago
Selinaee / FPGA_Gym
View on GitHub
☆21Dec 3, 2025Updated 7 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
marcelbinz / meta-learned-models
View on GitHub
☆13Mar 21, 2023Updated 3 years ago
jaehyeon-son / dicp
View on GitHub
Official implementation for ICLR 2025 paper "Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning"
☆20Mar 5, 2025Updated last year
iit-DLSLab / basic-locomotion-isaaclab
View on GitHub
An IsaacLab DirectEnv for basic quadrupedal locomotion tasks, with support for multiple quadruped robots, sim-to-sim, and sim-to-real pip…
☆96Updated this week
michaelyuancb / motiontrans-pi0
View on GitHub
Pi0-VLA Repository of "MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"
☆27Mar 9, 2026Updated 3 months ago
RLE-Foundation / Plasticine
View on GitHub
Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.
☆43Feb 9, 2026Updated 4 months ago
seawee1 / driver-dojo
View on GitHub
A benchmark towards generalizable reinforcement learning for autonomous driving.
☆90Oct 10, 2023Updated 2 years ago
sail-sg / offbench
View on GitHub
☆16Jun 1, 2023Updated 3 years ago