danielpalenicek / value_expansionLinks
☆15Updated 2 years ago
Alternatives and similar repositories for value_expansion
Users that are interested in value_expansion are comparing it to the libraries listed below
Sorting:
- Skeleton for scalable and flexible Jax RL implementations☆88Updated 2 years ago
 - Clean single-file implementation of offline RL algorithms in JAX☆158Updated 10 months ago
 - Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆80Updated last year
 - ☆49Updated 2 years ago
 - Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆191Updated 3 weeks ago
 - Simple maze environments using mujoco-py☆56Updated last year
 - PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆76Updated 2 years ago
 - Unofficial re-implementation of "Learning Latent Dynamics for Planning from Pixels" (https://arxiv.org/abs/1811.04551 ) with PyTorch☆46Updated 5 years ago
 - Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆81Updated 2 years ago
 - ☆48Updated 2 years ago
 - Benchmarking RL generalization in an interpretable way.☆166Updated 2 weeks ago
 - ExORL: Exploratory Data for Offline Reinforcement Learning☆116Updated 3 years ago
 - Conservative Q learning in Jax☆55Updated 2 years ago
 - JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆102Updated 3 years ago
 - OpenAI Gym wrapper for the DeepMind Control Suite☆223Updated last year
 - ☆45Updated 2 years ago
 - Deep Hierarchical Planning from Pixels☆109Updated 2 years ago
 - On the model-based stochastic value gradient for continuous reinforcement learning☆57Updated 2 years ago
 - Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆53Updated 4 years ago
 - Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 3 months ago
 - A framework for Reinforcement Learning research.☆163Updated this week
 - Jax/Flax Implementation of TD-MPC2☆66Updated last week
 - A collection of RL algorithms written in JAX.☆104Updated 3 years ago
 - Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated 2 years ago
 - ☆114Updated 2 years ago
 - Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆27Updated 2 years ago
 - DMControl Generalization Benchmark☆178Updated last year
 - Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆111Updated last year
 - ☆109Updated 8 months ago
 - NeurIPS Reproducibility Challenge 2019☆20Updated 5 years ago