seijin-kobayashi / cocoaLinks

Code accompanying the paper "Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis"

☆11

Alternatives and similar repositories for cocoa

Users that are interested in cocoa are comparing it to the libraries listed below

Sorting:

frt03 / jax_dt
Minimal Decision Transformer Implementation written in Jax (Flax).
☆17Updated 2 years ago
philipjball / ReadyPolicyOne
🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)
☆18Updated 2 years ago
notmahi / disk
PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…
☆19Updated 3 years ago
TrentBrick / RewardConditionedUDRL
Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies
☆18Updated 4 years ago
rll-research / teachable
☆17Updated last year
uncharted-technologies / risk-and-uncertainty
Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"
☆30Updated 2 years ago
subho406 / Recurrent-PPO-Jax
Implementation of Proximal Policy Optimization in Jax+Flax
☆20Updated 2 years ago
xingchenwan / bgpbt
[AutoML'22] Bayesian Generational Population-based Training (BG-PBT)
☆28Updated 2 years ago
kenjyoung / dreamerv2_JAX
An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.
☆15Updated 2 years ago
kchua / mbrl-jax
MBRL library in JAX
☆10Updated 2 years ago
zchuning / latco
Model-Based Reinforcement Learning via Latent-Space Collocation.
☆33Updated 2 years ago
google-deepmind / active_ops
☆32Updated 11 months ago
Cranial-XIX / metric-residual-network
Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning
☆17Updated 2 years ago
holarissun / RewardShifting
Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
☆30Updated last year
qgallouedec / lge
☆31Updated last year
nathanwispinski / meta-rl
A short conceptual replication of "Prefrontal cortex as a meta-reinforcement learning system" in Jax.
☆17Updated 2 years ago
MaxSobolMark / OOO
Official repo for Offline RL for Online RL
☆17Updated last year
Mehooz / BIRD_code
Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".
☆14Updated 4 years ago
avillaflor / SPLT-transformer
☆18Updated 3 years ago
adaptive-intelligent-robotics / QDAC
Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …
☆16Updated last year
younggyoseo / trajectory_mcl
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)
☆39Updated 4 years ago
seohongpark / CSD-locomotion
Controllability-Aware Unsupervised Skill Discovery (ICML 2023)
☆27Updated 2 years ago
Kaixhin / GUDRL
Generalised UDRL
☆37Updated 3 years ago
marc-rigter / waker
Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.
☆27Updated last year
alinlab / oreo
☆23Updated 3 years ago
Vision-CAIR / AF-Guide
Official repository of Action-Free Guide
☆11Updated 2 years ago
si0wang / COPlanner
☆23Updated last year
montrealrobotics / iv_rl
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆39Updated 8 months ago
SAIC-MONTREAL / hyperzero
Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"
☆20Updated 2 years ago
albertwilcox / mcac
Author implementation of Monte Carlo Augmented Actor Critic in PyTorch
☆17Updated 2 years ago