NVlabs / gbrl_sb3Links

GBRL-based Actor-Critic algorithms implemented in stable-baselines3

☆39

Alternatives and similar repositories for gbrl_sb3

Users that are interested in gbrl_sb3 are comparing it to the libraries listed below

Sorting:

ml-jku / LRAM
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
☆36Updated last year
NVlabs / gbrl
Gradient Boosting Reinforcement Learning (GBRL)
☆124Updated last week
btnorman / First-Explore
Repo to reproduce the First-Explore paper results
☆38Updated 10 months ago
facebookresearch / taskmet
TaskMet Task-driven Metric Learning for Model Learning
☆19Updated last year
lucidrains / SAC-pytorch
Implementation of Soft Actor Critic and some of its improvements in Pytorch
☆60Updated 9 months ago
facebookresearch / how-to-autorl
Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…
☆84Updated last year
keraJLi / synthetic-gymnax
Drop-in environment replacements that make your RL algorithm train faster.
☆21Updated last year
CEC-Agent / CEC
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
☆31Updated 2 years ago
Farama-Foundation / CrowdPlay
A web based platform for collecting human actions in reinforcement learning environments
☆30Updated 2 months ago
google-deepmind / agent_debugger
Causal Analysis of Agent Behavior for AI Safety
☆19Updated 2 years ago
Farama-Foundation / Jumpy
On-the-fly conversions between Jax and NumPy tensors
☆55Updated 2 years ago
facebookresearch / ssorl
Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
☆43Updated 2 years ago
andrew-silva / clean-rl-mlx
Clean RL implementation using MLX
☆33Updated last year
luchris429 / discovered-policy-optimisation
Code for Discovered Policy Optimisation (NeurIPS 2022)
☆12Updated 2 years ago
smearle / autoverse
Generative cellular automaton-like learning environments for RL.
☆19Updated 9 months ago
Kaixhin / GUDRL
Generalised UDRL
☆37Updated 3 years ago
vwxyzjn / a2c_is_a_special_case_of_ppo
A2C is a special case of PPO!
☆22Updated 3 years ago
google / deluca
Performant, differentiable reinforcement learning
☆124Updated 3 months ago
gkswamy98 / fast_irl
Contains implementation of the FILTER algorithm for exponentially faster inverse reinforcement learning.
☆51Updated 2 years ago
marc-rigter / waker
Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.
☆28Updated last year
Rose-STL-Lab / AutoSTPP
Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efﬁcient, non-parametric inf…
☆25Updated last year
brentyi / minGPT-flax
GPT implementation in Flax
☆18Updated 3 years ago
ivy-llc / gym
Fully differentiable RL environments, written in Ivy.
☆65Updated 2 years ago
quasimetric-learning / torch-quasimetric
PyTorch Package For Quasimetric Learning
☆44Updated last year
FLAIROx / cultural-accumulation
☆15Updated last year
DHDev0 / Muzero-unplugged
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…
☆34Updated 4 months ago
FLAIROx / jaxirl
Contains JAX implementation of algorithms for inverse reinforcement learning
☆73Updated last year
google-deepmind / tell_me_why_explanations_rl
☆37Updated 2 years ago
araffin / rlss23-dqn-tutorial
Deep Q-Network (DQN) and Fitted Q-Iteration (FQI) tutorial for RL Summer School 2023
☆85Updated last month
sai-prasanna / dreaming_of_many_worlds
☆23Updated last year