lucidrains / SAC-pytorchLinks

Implementation of Soft Actor Critic and some of its improvements in Pytorch

☆59

Alternatives and similar repositories for SAC-pytorch

Users that are interested in SAC-pytorch are comparing it to the libraries listed below

Sorting:

lucidrains / improving-transformers-world-model-for-rl
Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch
☆128Updated 3 months ago
lucidrains / ppo
An implementation of PPO in Pytorch
☆93Updated last month
vmicheli / delta-iris
Efficient World Models with Context-Aware Tokenization. ICML 2024
☆105Updated 10 months ago
DHDev0 / Muzero-unplugged
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…
☆31Updated last month
vwxyzjn / cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆114Updated 11 months ago
chandar-lab / Recall2Imagine
Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024
☆70Updated last year
ml-jku / LRAM
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
☆33Updated 9 months ago
Reytuag / transformerXL_PPO_JAX
☆81Updated 9 months ago
lucidrains / evolutionary-policy-optimization
Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of the Robotics Institute at Carnegie Mellon University
☆97Updated this week
facebookresearch / MRQ
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆106Updated last month
luchris429 / popjaxrl
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆109Updated last year
ollebompa / PGA-MAP-Elites
Repository for the PGA-MAP-Elites algorithm. PGA-MAP-Elites was developed to efficiently scale MAP-Elites to large genotypes and noisy d…
☆58Updated 3 years ago
keraJLi / synthetic-gymnax
Drop-in environment replacements that make your RL algorithm train faster.
☆21Updated last year
tinkoff-ai / ReBRAC
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆55Updated 2 years ago
dunnolab / xland-minigrid-datasets
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025
☆78Updated 5 months ago
lucidrains / scaling-vin-pytorch
Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group
☆36Updated 10 months ago
adityabingi / Dreamer
Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite
☆43Updated 2 years ago
facebookresearch / minimax
Efficient baselines for autocurricula in JAX.
☆191Updated 11 months ago
facebookresearch / mtm
MTM Masked Trajectory Models for Prediction, Representation, and Control.
☆157Updated 2 years ago
DramaCow / jaxued
☆82Updated 4 months ago
rail-berkeley / SUPE
This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."
☆31Updated 3 weeks ago
dojeon-ai / SimbaV2
Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"
☆58Updated 2 months ago
MarcoMeter / endless-memory-gym
Challenging Memory-based Deep Reinforcement Learning Agents
☆102Updated 9 months ago
radarFudan / mamba-minimal-jax
☆31Updated 8 months ago
zach-lawless / gym-wordle
Gym environment for playing Wordle with RL agents
☆39Updated 3 years ago
seohongpark / HILP
Foundation Policies with Hilbert Representations (ICML 2024)
☆90Updated last year
facebookresearch / bc-irl
Implementation of BC-IRL and other IRL baselines
☆28Updated 2 years ago
vivekmyers / contrastive_metrics
Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"
☆27Updated last year
facebookresearch / how-to-autorl
Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…
☆83Updated last year
SonyResearch / simba
☆102Updated 5 months ago