Stilwell-Git/Randomized-Return-Decomposition

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Stilwell-Git/Randomized-Return-Decomposition)

Stilwell-Git / Randomized-Return-Decomposition

TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"

☆19

Alternatives and similar repositories for Randomized-Return-Decomposition

Users that are interested in Randomized-Return-Decomposition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tgangwani / GuidanceRewards
View on GitHub
Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)
☆12Jul 7, 2021Updated 5 years ago
DesikRengarajan / LOGO
View on GitHub
[ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration
☆28Feb 10, 2022Updated 4 years ago
sfujim / LAP-PAL
View on GitHub
Author's PyTorch implementation of LAP and PAL with TD3 and DDQN
☆41Dec 7, 2021Updated 4 years ago
RoozbehRazavi / BIMRL
View on GitHub
Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)
☆10Dec 1, 2022Updated 3 years ago
ReedZyd / GenerativeReturnDecomposition
View on GitHub
Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)
☆10Dec 12, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
holarissun / RewardShifting
View on GitHub
Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
☆29Oct 29, 2023Updated 2 years ago
xingruiyu / GIRIL
View on GitHub
ICML'20: Intrinsic Reward Driven Imitation Learning via Generative Model
☆15Nov 5, 2021Updated 4 years ago
uber-research / D3G
View on GitHub
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
zwfightzw / Meta-Critic
View on GitHub
☆11Oct 19, 2020Updated 5 years ago
lizhuo-1994 / NECSA
View on GitHub
Official implementation of Neural Episodic Control with State Abstraction
☆13Aug 3, 2023Updated 2 years ago
illidanlab / opolo-code
View on GitHub
☆32Mar 4, 2021Updated 5 years ago
nicklashansen / svea-vit
View on GitHub
Code for the paper "Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation"
☆19Jul 11, 2023Updated 3 years ago
ml-jku / OfflineRL
View on GitHub
☆31Jan 16, 2023Updated 3 years ago
akakzia / decstr
View on GitHub
☆15Aug 9, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mansicer / Q-Adapter
View on GitHub
Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"
☆18Oct 5, 2024Updated last year
twni2016 / Memory-RL
View on GitHub
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)
☆73Apr 26, 2026Updated 2 months ago
facebookresearch / interaction-exploration
View on GitHub
Code for "Learning Affordance Landscapes for Interaction Exploration in 3D Environments" (NeurIPS 20)
☆38Jul 6, 2023Updated 3 years ago
jidiai / olympics_engine
View on GitHub
A simple 2D ball collision engine.
☆12Jun 15, 2023Updated 3 years ago
huxiao09 / QPA
View on GitHub
☆13Sep 24, 2024Updated last year
rll-research / rune
View on GitHub
Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learning
☆15May 26, 2022Updated 4 years ago
tigerneil / reinforcementlearning.today
View on GitHub
Made for a reading group at the Center for Safe AGI.
☆12Feb 23, 2026Updated 4 months ago
philipjball / TD3_PyTorch
View on GitHub
♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation
☆10Jun 20, 2021Updated 5 years ago
dyne-submission / dynamics-aware-embeddings
View on GitHub
☆16Sep 25, 2019Updated 6 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
holarissun / RewardModelingBeyondBradleyTerry
View on GitHub
official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…
☆73Apr 2, 2025Updated last year
ml-jku / rudder-demonstration-code
View on GitHub
Code for demonstration example-task in RUDDER blog
☆24May 19, 2020Updated 6 years ago
NagisaZj / MetaCURE-Public
View on GitHub
☆15Apr 5, 2023Updated 3 years ago
facebookresearch / go-fresh
View on GitHub
Original code for the paper "Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping" by Mezghani et al.
☆18Jun 8, 2023Updated 3 years ago
Stanford-ILIAD / TREX-pytorch
View on GitHub
A PyTorch implementation for the paper 'Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observatio…
☆14Sep 22, 2021Updated 4 years ago
sfujim / SR-DICE
View on GitHub
Author's PyTorch implementation of SR-DICE for marginalized importance sampling
☆28Dec 7, 2021Updated 4 years ago
dmksjfl / DARC
View on GitHub
Code for Efficient Continuous Control with Double Actors and Regularized Critics, AAAI 2022.
☆22Mar 11, 2022Updated 4 years ago
tesslerc / GAC
View on GitHub
Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"
☆22Dec 17, 2019Updated 6 years ago
YangRui2015 / Modular_HER
View on GitHub
Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.
☆17Jun 23, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yfletberliac / adversarially-guided-actor-critic
View on GitHub
AGAC: Adversarially Guided Actor-Critic
☆47Sep 16, 2021Updated 4 years ago
taodav / nsrs
View on GitHub
Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.
☆14Jul 16, 2024Updated 2 years ago
wensun / Imitation-Learning-from-Observation
View on GitHub
☆24Jul 6, 2023Updated 3 years ago
nigelyaoj / Quality-Similar-Diversity
View on GitHub
Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning
☆19Dec 26, 2025Updated 6 months ago
kingdy2002 / VCSE
View on GitHub
☆18Jun 8, 2023Updated 3 years ago
facebookresearch / RLCD
View on GitHub
Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment
☆70Aug 18, 2023Updated 2 years ago
keynans / HypeRL
View on GitHub
Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)
☆26Jun 9, 2021Updated 5 years ago