DAVIAN-Robotics/SimbaV2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DAVIAN-Robotics/SimbaV2)

DAVIAN-Robotics / SimbaV2

Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"

☆108

Alternatives and similar repositories for SimbaV2

Users that are interested in SimbaV2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SonyResearch / simba
View on GitHub
☆128Feb 25, 2025Updated last year
facebookresearch / MRQ
View on GitHub
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆153Apr 7, 2026Updated 3 months ago
lilucse / SparseNetwork4DRL
View on GitHub
[ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
☆41Jun 5, 2025Updated last year
naumix / BiggerRegularizedOptimistic
View on GitHub
Official implementation of the BRO algorithm
☆61Jan 29, 2025Updated last year
naumix / BiggerRegularizedCategorical
View on GitHub
☆17Apr 23, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
roger-creus / stable-deep-rl-at-scale
View on GitHub
Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments…
☆39Oct 24, 2025Updated 8 months ago
RLE-Foundation / Plasticine
View on GitHub
Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.
☆44Feb 9, 2026Updated 5 months ago
danielpalenicek / xqc
View on GitHub
Official code release for ICLR26 "XQC: Well-conditioned Optimization Accelerates Deep Reinforcement Learning"
☆30Jun 3, 2026Updated last month
younggyoseo / FastTD3
View on GitHub
☆456May 16, 2026Updated 2 months ago
seohongpark / horizon-reduction
View on GitHub
The official implementation of "Horizon Reduction Makes RL Scalable"
☆200Aug 2, 2025Updated 11 months ago
nicklashansen / newt
View on GitHub
Official code repository for the paper "Learning Massively Multitask World Models for Continuous Control".
☆128Jan 9, 2026Updated 6 months ago
Viraj-Joshi / MTBench
View on GitHub
☆45Jul 1, 2026Updated 3 weeks ago
isaac7778 / FIRE
View on GitHub
Code for the paper "FIRE: Frobenius-Isometry Reinitialization for Balancing the Stability–Plasticity Tradeoff" (ICLR 2026 Oral)
☆29Apr 27, 2026Updated 2 months ago
Holiday-Robot / FlashSAC
View on GitHub
FlashSAC: Fast and Stable Off-Policy Reinforcement Learning for High-Dimensional Robot Control
☆389Apr 9, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
AlexGoldie / learn-rl-algorithms
View on GitHub
Official implementation for "How Should We Meta-Learn Reinforcement Learning Algorithms?"
☆23Sep 7, 2025Updated 10 months ago
cvoelcker / reppo
View on GitHub
Official Code for "Relative Entropy Pathwise Policy Optimization"
☆59May 6, 2026Updated 2 months ago
OffDynamicsRL / off-dynamics-rl
View on GitHub
☆65Jan 30, 2026Updated 5 months ago
wertyuilife2 / bmpc
View on GitHub
[ICLR 2025] Bootstrapped Model Predictive Control
☆39Jul 21, 2025Updated last year
nakamotoo / Cal-QL
View on GitHub
official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)
☆123Jul 31, 2024Updated last year
pd-perry / TQL
View on GitHub
☆28May 11, 2026Updated 2 months ago
nicklashansen / tdmpc2
View on GitHub
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
☆897Jul 13, 2026Updated last week
Guozheng-Ma / Adaptive-Replay-Ratio
View on GitHub
[ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.
☆13Oct 9, 2024Updated last year
mttga / purejaxql
View on GitHub
Simple single-file baselines for Q-Learning in pure-GPU setting
☆242Nov 24, 2025Updated 7 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
dojeon-ai / Atari-PB
View on GitHub
Official repository for "Investigating Pre-Training Objectives for Generalization in Visual Reinforcement Learning" (ICML 2024)
☆11Sep 16, 2025Updated 10 months ago
MichaelTMatthews / Craftax
View on GitHub
(Crafter + NetHack) in JAX. ICML 2024 Spotlight.
☆424Jun 20, 2026Updated last month
joonleesky / train-procgen-pytorch
View on GitHub
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆31Sep 10, 2020Updated 5 years ago
adityab / CrossQ
View on GitHub
Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"
☆95Jun 4, 2024Updated 2 years ago
chongyi-zheng / value-flows
View on GitHub
The official implementation of Value Flows
☆55Feb 27, 2026Updated 4 months ago
dojeon-ai / SimTPR
View on GitHub
Code for the paper "On the Importance of Feature Decorrelation for Unsupervised Representation Learning for RL" (ICML 2023)
☆12Jun 13, 2023Updated 3 years ago
seohongpark / fql
View on GitHub
The official implementation of flow Q-learning (FQL)
☆320Jul 21, 2025Updated last year
XuGW-Kevin / DrM
View on GitHub
DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …
☆78Feb 19, 2026Updated 5 months ago
EmptyJackson / unifloral
View on GitHub
Unified Implementations of Offline Reinforcement Learning Algorithms
☆224Dec 19, 2025Updated 7 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
seohongpark / ogbench
View on GitHub
A benchmark for offline goal-conditioned RL and offline RL
☆438Jan 14, 2026Updated 6 months ago
tinker495 / jax-baseline
View on GitHub
Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…
☆67Updated this week
typoverflow / flow-rl
View on GitHub
Flow RL is a high-performance RL library with flow and diffusion models.
☆42Jun 16, 2026Updated last month
ColinQiyangLi / qc
View on GitHub
☆392Feb 5, 2026Updated 5 months ago
mohmdelsayed / streaming-drl
View on GitHub
Deep reinforcement learning without experience replay, target networks, or batch updates.
☆292Mar 18, 2025Updated last year
mahaitongdae / diffusion_policy_online_rl
View on GitHub
[ICML2025] Official implementation of Efficient Online Reinforcement Learning for Diffusion Policies appearing in ICML 2025.
☆59Apr 25, 2026Updated 2 months ago
aalmuzairee / dmcgb2
View on GitHub
Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)
☆22Jul 21, 2025Updated last year