adityab / CrossQ
Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"
☆53Updated 3 months ago
Related projects: ⓘ
- ☆41Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆59Updated 2 months ago
- ☆37Updated last year
- Skeleton for scalable and flexible Jax RL implementations☆58Updated last year
- ☆23Updated 2 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆87Updated 3 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆49Updated 11 months ago
- ☆33Updated last year
- Simple maze environments using mujoco-py☆52Updated 8 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆71Updated 9 months ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆16Updated 8 months ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆31Updated 6 months ago
- Jax/Flax Implementation of TD-MPC2☆42Updated last month
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆76Updated last year
- Scripts to recreate the D4RL datasets with Minari☆16Updated last week
- Implementation of Tactical Optimistic and Pessimistic value estimation☆24Updated last year
- Synthetic Experience Replay☆62Updated 3 months ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆19Updated 5 months ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆22Updated last year
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆68Updated last month
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆24Updated last year
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆33Updated last week
- EARL: Environment for Autonomous Reinforcement Learning☆33Updated last year
- Implementation of Jump-Start Reinforcement Learning (JSRL) with Stable Baselines3☆24Updated 8 months ago
- A version of the DeepMind Control Suite with randomly generated graphics, for measuring visual generalization in continuous control.☆16Updated 3 years ago
- Conservative Q learning in Jax☆49Updated last year
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆14Updated 5 months ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆65Updated 5 months ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆100Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆49Updated 8 months ago