dojeon-ai / SimbaV2Links

Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"

☆72

Alternatives and similar repositories for SimbaV2

Users that are interested in SimbaV2 are comparing it to the libraries listed below

Sorting:

SonyResearch / simba
☆109Updated 8 months ago
seohongpark / HILP
Foundation Policies with Hilbert Representations (ICML 2024)
☆98Updated last month
EmptyJackson / unifloral
Unified Implementations of Offline Reinforcement Learning Algorithms
☆115Updated 2 weeks ago
seohongpark / METRA
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)
☆78Updated 2 years ago
facebookresearch / MRQ
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆117Updated 4 months ago
kvfrans / fre
Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"
☆57Updated last year
adityab / CrossQ
Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"
☆79Updated last year
seohongpark / HIQL
HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)
☆91Updated 10 months ago
dibyaghosh / jaxrl_m
Skeleton for scalable and flexible Jax RL implementations
☆88Updated 2 years ago
kvfrans / rlbase_stable
☆43Updated last year
mttga / purejaxql
Simple single-file baselines for Q-Learning in pure-GPU setting
☆187Updated 7 months ago
tinker495 / jax-baseline
Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…
☆60Updated last month
ikostrikov / jaxrl2
☆48Updated 2 years ago
DramaCow / jaxued
☆85Updated last month
luchris429 / popjaxrl
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆110Updated last year
philippe-eecs / IDQL
Repo for Implicit Diffusion Q-Learning
☆116Updated last year
RajGhugare19 / alm
Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective
☆81Updated 2 years ago
nissymori / JAX-CORL
Clean single-file implementation of offline RL algorithms in JAX
☆158Updated 10 months ago
seohongpark / horizon-reduction
The official implementation of "Horizon Reduction Makes RL Scalable"
☆149Updated 2 months ago
Div-Infinity / XQL
Extreme Q-Learning: Max Entropy RL without Entropy
☆87Updated 2 years ago
conglu1997 / v-d4rl
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
☆111Updated last year
ml-jku / L2M
Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)
☆59Updated last year
roger-creus / stable-deep-rl-at-scale
Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments…
☆31Updated this week
young-geng / JaxCQL
Conservative Q learning in Jax
☆55Updated 2 years ago
RajGhugare19 / stitching-is-combinatorial-generalisation
[ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.
☆23Updated last year
AlexGoldie / rl-learned-optimization
Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"
☆28Updated 6 months ago
MarcoMeter / endless-memory-gym
Challenging Memory-based Deep Reinforcement Learning Agents
☆104Updated last year
UT-Austin-RPL / amago
off-policy RL on long sequences
☆146Updated 2 months ago
MichaelTMatthews / Craftax_Baselines
☆18Updated 5 months ago
chandar-lab / Recall2Imagine
Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024
☆74Updated last year