holarissun/RewardShifting

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/holarissun/RewardShifting)

holarissun / RewardShifting

Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL

☆29

Alternatives and similar repositories for RewardShifting

Users that are interested in RewardShifting are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tinkoff-ai / cnf
View on GitHub
Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…
☆12Jan 31, 2023Updated 3 years ago
LeapLabTHU / MOSS
View on GitHub
Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning
☆23Nov 16, 2022Updated 3 years ago
notmahi / disk
View on GitHub
PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…
☆21Mar 22, 2022Updated 4 years ago
kenjyoung / dreamerv2_JAX
View on GitHub
An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.
☆18Jan 16, 2023Updated 3 years ago
tinkoff-ai / sac-rnd
View on GitHub
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
☆58Feb 3, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
uoe-agents / CMID
View on GitHub
☆13Apr 25, 2024Updated 2 years ago
Stilwell-Git / Randomized-Return-Decomposition
View on GitHub
TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"
☆19Mar 17, 2022Updated 4 years ago
dunnolab / NinA
View on GitHub
Official implementation of "NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows"
☆17Sep 22, 2025Updated 10 months ago
flowersteam / curious
View on GitHub
Implementation of CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning
☆27May 15, 2020Updated 6 years ago
sash-a / CleanRL.jl
View on GitHub
Simple single file implementations of Reinforcement Learning algorithms in Julia
☆24Feb 15, 2025Updated last year
CEC-Agent / CEC
View on GitHub
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
☆32Oct 12, 2023Updated 2 years ago
ReedZyd / GenerativeReturnDecomposition
View on GitHub
Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)
☆10Dec 12, 2023Updated 2 years ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
yuchen-x / MacroMARL
View on GitHub
☆26Apr 16, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
abhayraw1 / planet-torch
View on GitHub
A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning
☆13Aug 31, 2020Updated 5 years ago
tedmoskovitz / TOP
View on GitHub
Implementation of Tactical Optimistic and Pessimistic value estimation
☆25Jul 18, 2023Updated 3 years ago
sparisi / cbet
View on GitHub
Change-Based Exploration Transfer
☆35Apr 24, 2022Updated 4 years ago
Improbable-AI / eipo
View on GitHub
Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization
☆83Apr 13, 2023Updated 3 years ago
max7born / decision-lstm
View on GitHub
Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…
☆28Mar 24, 2023Updated 3 years ago
hwang-ua / inac_pytorch
View on GitHub
☆20Jun 25, 2023Updated 3 years ago
Div-Infinity / XQL
View on GitHub
Extreme Q-Learning: Max Entropy RL without Entropy
☆88Feb 14, 2023Updated 3 years ago
twni2016 / Memory-RL
View on GitHub
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)
☆73Apr 26, 2026Updated 2 months ago
zhaoyi11 / tcrl
View on GitHub
☆26Jan 26, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / svg
View on GitHub
On the model-based stochastic value gradient for continuous reinforcement learning
☆58Mar 6, 2026Updated 4 months ago
martius-lab / cee-us
View on GitHub
Repository for the paper: "Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation" @ NeurIPS 2022
☆21Jul 10, 2023Updated 3 years ago
subho406 / Recurrent-PPO-Jax
View on GitHub
Implementation of Proximal Policy Optimization in Jax+Flax
☆21May 18, 2023Updated 3 years ago
conglu1997 / SynthER
View on GitHub
Synthetic Experience Replay
☆114Apr 16, 2026Updated 3 months ago
hari-sikchi / AWAC
View on GitHub
Advantage weighted Actor Critic for Offline RL
☆53Aug 27, 2022Updated 3 years ago
baturaysaglam / actor-prioritized-exp-replay
View on GitHub
Actor Prioritized Experience Replay
☆19Nov 20, 2023Updated 2 years ago
lizhuo-1994 / NECSA
View on GitHub
Official implementation of Neural Episodic Control with State Abstraction
☆13Aug 3, 2023Updated 2 years ago
holarissun / embedding-based-llm-alignment
View on GitHub
Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs
☆22Apr 24, 2025Updated last year
Pervasive-AI-Lab / crlmaze
View on GitHub
Continual Reinforcement Learning in 3D Non-stationary Environments
☆39Jun 16, 2019Updated 7 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
instadeepai / qd-skill-discovery-benchmark
View on GitHub
Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery
☆17Apr 2, 2026Updated 3 months ago
rll-research / cic
View on GitHub
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
☆88Jul 27, 2022Updated 3 years ago
ToruOwO / mimex
View on GitHub
MIMEx: Intrinsic Rewards from Masked Input Modeling [NeurIPS 2023]
☆16May 17, 2023Updated 3 years ago
webstorms / Blocks
View on GitHub
A new model for quickly training and simulating adaptive leaky integrate-and-fire spiking neural networks.
☆14Apr 9, 2024Updated 2 years ago
aalmuzairee / dmcgb2
View on GitHub
Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)
☆22Jul 21, 2025Updated last year
milarobotlearningcourse / mini_crossformer
View on GitHub
☆16Aug 15, 2025Updated 11 months ago
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago