danielpalen / value_expansion

☆15

Alternatives and similar repositories for value_expansion:

Users that are interested in value_expansion are comparing it to the libraries listed below

ikostrikov / jaxrl2
☆47Updated 2 years ago
kngwyu / mujoco-maze
Simple maze environments using mujoco-py
☆54Updated last year
young-geng / JaxCQL
Conservative Q learning in Jax
☆53Updated 2 years ago
jxu43 / replication-mbpo
NeurIPS Reproducibility Challenge 2019
☆20Updated 5 years ago
MichalBortkiewicz / JaxGCRL
Goal-Conditioned Reinforcement Learning with JAX
☆142Updated 3 weeks ago
adityab / CrossQ
Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"
☆70Updated 10 months ago
sebascuri / hucrl
☆30Updated last year
RajGhugare19 / alm
Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective
☆80Updated 2 years ago
nissymori / JAX-CORL
Clean single-file implementation of offline RL algorithms in JAX
☆143Updated 4 months ago
cross32768 / PlaNet_PyTorch
Unofficial re-implementation of "Learning Latent Dynamics for Planning from Pixels" (https://arxiv.org/abs/1811.04551 ) with PyTorch
☆47Updated 4 years ago
illidanlab / opolo-code
☆31Updated 4 years ago
facebookresearch / svg
On the model-based stochastic value gradient for continuous reinforcement learning
☆55Updated last year
dibyaghosh / jaxrl_m
Skeleton for scalable and flexible Jax RL implementations
☆80Updated last year
EmptyJackson / unifloral
Unified Implementations of Offline Reinforcement Learning Algorithms
☆56Updated this week
pairlab / vagram
[ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.
☆24Updated 2 years ago
Wenxuan-Zhou / PLAS
Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]
☆52Updated 3 years ago
automl / CARL
Benchmarking RL generalization in an interpretable way.
☆153Updated last month
OffDynamicsRL / off-dynamics-rl
☆43Updated 5 months ago
ikostrikov / dmcgym
☆23Updated 2 years ago
perrin-isir / xpag
a modular reinforcement learning library with JAX agents
☆26Updated last month
samlobel / CFN
Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023
☆19Updated last year
Aladoro / Stabilizing-Off-Policy-RL
☆15Updated 2 years ago
martius-lab / pink-noise-rl
☆42Updated 2 years ago
timoklein / redo
ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)
☆25Updated 6 months ago
theogruner / rl_pro_telu
☆22Updated 3 years ago
tedmoskovitz / TOP
Implementation of Tactical Optimistic and Pessimistic value estimation
☆25Updated last year
MushroomRL / mushroom-rl-benchmark
Benchmarking suite for MushroomRL Deep RL algorithms
☆15Updated last year
facebookresearch / hsd3
Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines
☆49Updated 2 years ago
conglu1997 / v-d4rl
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
☆100Updated 10 months ago
ahmed-touati / controllable_agent
☆44Updated last year