danielpalenicek / value_expansionLinks
☆15Updated 2 years ago
Alternatives and similar repositories for value_expansion
Users that are interested in value_expansion are comparing it to the libraries listed below
Sorting:
- Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆186Updated 4 months ago
- ☆46Updated 2 years ago
- Clean single-file implementation of offline RL algorithms in JAX☆154Updated 9 months ago
- Simple maze environments using mujoco-py☆56Updated last year
- ☆44Updated 2 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆81Updated 2 years ago
- Benchmarking RL generalization in an interpretable way.☆162Updated 3 months ago
- Skeleton for scalable and flexible Jax RL implementations☆86Updated 2 years ago
- Conservative Q learning in Jax☆55Updated 2 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 3 years ago
- Unofficial re-implementation of "Learning Latent Dynamics for Planning from Pixels" (https://arxiv.org/abs/1811.04551 ) with PyTorch☆47Updated 5 years ago
- Unified Implementations of Offline Reinforcement Learning Algorithms☆95Updated 4 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆79Updated last year
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆76Updated 2 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆148Updated 2 years ago
- A framework for Reinforcement Learning research.☆159Updated 3 weeks ago
- Deep Hierarchical Planning from Pixels☆107Updated 2 years ago
- Jax/Flax Implementation of TD-MPC2☆66Updated last month
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆108Updated last year
- ExORL: Exploratory Data for Offline Reinforcement Learning☆115Updated 3 years ago
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆131Updated 3 years ago
- ☆105Updated 6 months ago
- ☆23Updated 3 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆56Updated 2 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆221Updated last year
- Partially Observable Process Gym☆199Updated 3 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆89Updated 9 months ago
- ☆114Updated 2 years ago
- DMControl Generalization Benchmark☆175Updated last year
- ☆22Updated 4 years ago