SonyResearch/simba

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SonyResearch/simba)

SonyResearch / simba

☆128

Alternatives and similar repositories for simba

Users that are interested in simba are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DAVIAN-Robotics / SimbaV2
View on GitHub
Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"
☆108Nov 4, 2025Updated 8 months ago
dojeon-ai / Atari-PB
View on GitHub
Official repository for "Investigating Pre-Training Objectives for Generalization in Visual Reinforcement Learning" (ICML 2024)
☆11Sep 16, 2025Updated 10 months ago
naumix / BiggerRegularizedOptimistic
View on GitHub
Official implementation of the BRO algorithm
☆61Jan 29, 2025Updated last year
dojeon-ai / SimTPR
View on GitHub
Code for the paper "On the Importance of Feature Decorrelation for Unsupervised Representation Learning for RL" (ICML 2023)
☆12Jun 13, 2023Updated 3 years ago
roger-creus / stable-deep-rl-at-scale
View on GitHub
Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments…
☆39Oct 24, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lilucse / SparseNetwork4DRL
View on GitHub
[ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
☆41Jun 5, 2025Updated last year
AlexGoldie / learn-rl-algorithms
View on GitHub
Official implementation for "How Should We Meta-Learn Reinforcement Learning Algorithms?"
☆23Sep 7, 2025Updated 10 months ago
mttga / purejaxql
View on GitHub
Simple single-file baselines for Q-Learning in pure-GPU setting
☆242Nov 24, 2025Updated 7 months ago
facebookresearch / MRQ
View on GitHub
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆153Apr 7, 2026Updated 3 months ago
RLE-Foundation / Plasticine
View on GitHub
Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.
☆44Feb 9, 2026Updated 5 months ago
nakamotoo / Cal-QL
View on GitHub
official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)
☆123Jul 31, 2024Updated last year
dojeon-ai / PLASTIC
View on GitHub
Code for the paper "PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning" (NeurIPS 2023)
☆23Dec 8, 2023Updated 2 years ago
adityab / CrossQ
View on GitHub
Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"
☆95Jun 4, 2024Updated 2 years ago
OffDynamicsRL / off-dynamics-rl
View on GitHub
☆65Jan 30, 2026Updated 5 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
MichaelTMatthews / Craftax
View on GitHub
(Crafter + NetHack) in JAX. ICML 2024 Spotlight.
☆424Jun 20, 2026Updated last month
dojeon-ai / DraftRec
View on GitHub
Code for the paper "DraftRec: Personalized Draft Recommendation for Winning in Multi-Player Online Battle Arena Games" (WWW 2022)
☆18Aug 11, 2023Updated 2 years ago
seohongpark / horizon-reduction
View on GitHub
The official implementation of "Horizon Reduction Makes RL Scalable"
☆199Aug 2, 2025Updated 11 months ago
d5rlbenchmark / d5rl
View on GitHub
☆31Oct 3, 2023Updated 2 years ago
Howuhh / streaming-drl-jax
View on GitHub
streaming deep reinforcement learning but 4x faster with jax!
☆19Jan 4, 2026Updated 6 months ago
gauthamvasan / avg
View on GitHub
Action Value Gradient Algorithm
☆28May 18, 2025Updated last year
XuGW-Kevin / DrM
View on GitHub
DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …
☆78Feb 19, 2026Updated 5 months ago
seohongpark / fql
View on GitHub
The official implementation of flow Q-learning (FQL)
☆320Jul 21, 2025Updated 11 months ago
roger-creus / Wave-Defense-Learning-Environment
View on GitHub
A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.
☆14Jan 3, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
wertyuilife2 / bmpc
View on GitHub
[ICLR 2025] Bootstrapped Model Predictive Control
☆39Jul 21, 2025Updated 11 months ago
younggyoseo / FastTD3
View on GitHub
☆456May 16, 2026Updated 2 months ago
aalmuzairee / dmcgb2
View on GitHub
Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)
☆22Jul 21, 2025Updated last year
RajGhugare19 / builderbench
View on GitHub
☆35Mar 26, 2026Updated 3 months ago
AlexGoldie / rl-learned-optimization
View on GitHub
Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"
☆31Dec 15, 2025Updated 7 months ago
isaac7778 / FIRE
View on GitHub
Code for the paper "FIRE: Frobenius-Isometry Reinitialization for Balancing the Stability–Plasticity Tradeoff" (ICLR 2026 Oral)
☆29Apr 27, 2026Updated 2 months ago
cvoelcker / reppo
View on GitHub
Official Code for "Relative Entropy Pathwise Policy Optimization"
☆59May 6, 2026Updated 2 months ago
nico-bohlinger / RL-X
View on GitHub
A framework for Reinforcement Learning research.
☆267Updated this week
aidanscannell / iqrl
View on GitHub
iQRL: implicitly Quantized Representations for Sample-efficient Reinforcement Learning
☆12Jan 8, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
evgenii-nikishin / rl_with_resets
View on GitHub
JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"
☆107May 17, 2022Updated 4 years ago
joonleesky / train-procgen-pytorch
View on GitHub
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆31Sep 10, 2020Updated 5 years ago
pd-perry / TQL
View on GitHub
☆28May 11, 2026Updated 2 months ago
seohongpark / METRA
View on GitHub
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)
☆92Oct 15, 2023Updated 2 years ago
StoneT2000 / rl-robotics-speedrun
View on GitHub
speed-running solving robot manipulation tasks
☆24Oct 31, 2024Updated last year
MichaelTMatthews / Craftax_Baselines
View on GitHub
☆28Jun 16, 2026Updated last month
Viraj-Joshi / MTBench
View on GitHub
☆45Jul 1, 2026Updated 2 weeks ago