danielpalen / value_expansion
☆15Updated last year
Related projects: ⓘ
- Simple maze environments using mujoco-py☆52Updated 8 months ago
- Conservative Q learning in Jax☆49Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆87Updated 3 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆53Updated 3 months ago
- ☆41Updated last year
- ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)☆22Updated 2 months ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆47Updated 2 years ago
- Benchmarking RL generalization in an interpretable way.☆128Updated 7 months ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆76Updated last year
- Skeleton for scalable and flexible Jax RL implementations☆58Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆59Updated 2 months ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆31Updated last year
- ☆29Updated 3 years ago
- ☆37Updated last year
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆16Updated 8 months ago
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020☆37Updated last year
- ☆30Updated 10 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆108Updated last year
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆70Updated last year
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆68Updated last month
- ☆20Updated 3 years ago
- Implementation of Truncated Quantile Critics method for continuous reinforcement learning. https://bayesgroup.github.io/tqc/☆90Updated 3 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆100Updated 2 years ago
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …☆70Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆96Updated 2 years ago
- NeurIPS Reproducibility Challenge 2019☆21Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆24Updated last year
- improved Cross Entropy Method for trajectory optimization☆65Updated 3 years ago
- ☆102Updated 4 years ago
- A collection of RL algorithms written in JAX.☆92Updated 2 years ago