Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
☆29Oct 29, 2023Updated 2 years ago
Alternatives and similar repositories for RewardShifting
Users that are interested in RewardShifting are comparing it to the libraries listed below
Sorting:
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆20Mar 22, 2022Updated 3 years ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning☆23Nov 16, 2022Updated 3 years ago
- ☆13Apr 25, 2024Updated last year
- Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023☆56Feb 3, 2023Updated 3 years ago
- Simple single file implementations of Reinforcement Learning algorithms in Julia☆23Feb 15, 2025Updated last year
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆12Aug 31, 2020Updated 5 years ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆28Mar 24, 2023Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- Implementation of CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning☆27May 15, 2020Updated 5 years ago
- Implementation of Proximal Policy Optimization in Jax+Flax☆21May 18, 2023Updated 2 years ago
- Advantage weighted Actor Critic for Offline RL☆52Aug 27, 2022Updated 3 years ago
- Change-Based Exploration Transfer☆35Apr 24, 2022Updated 3 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Jan 7, 2026Updated last month
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆83Apr 13, 2023Updated 2 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 3 years ago
- ☆19Jun 25, 2023Updated 2 years ago
- ☆24Jan 26, 2024Updated 2 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Feb 14, 2023Updated 3 years ago
- Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)☆22Jul 21, 2025Updated 7 months ago
- Synthetic Experience Replay☆109May 27, 2024Updated last year
- Repository for the paper: "Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation" @ NeurIPS 2022☆21Jul 10, 2023Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆70Jan 18, 2024Updated 2 years ago
- ☆25Apr 16, 2024Updated last year
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)☆10Dec 12, 2023Updated 2 years ago
- PyTorch implementation for all methods and environments in the paper "MIMEx: Intrinsic Rewards from Masked Input Modeling"☆16May 17, 2023Updated 2 years ago
- ☆10Jun 27, 2024Updated last year
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆18Jan 16, 2023Updated 3 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆71Jul 17, 2025Updated 7 months ago
- The official implementation of "Horizon Reduction Makes RL Scalable"☆181Aug 2, 2025Updated 7 months ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆39Jun 16, 2019Updated 6 years ago
- Code for the paper: Causal Action Influence Aware Counterfactual Data Augmentation @ICML2024☆11Jul 19, 2024Updated last year
- ☆11Oct 19, 2020Updated 5 years ago
- Gym env for Slay the Spire☆16Dec 31, 2024Updated last year
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆17Nov 24, 2025Updated 3 months ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆83Jul 27, 2022Updated 3 years ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆57May 21, 2023Updated 2 years ago
- Jaxpr Visualisation Tool☆36Dec 22, 2024Updated last year