seijin-kobayashi / cocoa
Code accompanying the paper "Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis"
☆11Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for cocoa
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆18Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆23Updated last year
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆12Updated 2 years ago
- Official repo for Offline RL for Online RL☆14Updated last year
- A short conceptual replication of "Prefrontal cortex as a meta-reinforcement learning system" in Jax.☆14Updated last year
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Updated last year
- Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354☆25Updated 3 years ago
- ☆30Updated 3 months ago
- ☆29Updated 7 months ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆15Updated 3 years ago
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆14Updated last year
- Implementation of Proximal Policy Optimization in Jax+Flax☆16Updated last year
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆13Updated last year
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆31Updated last year
- Scalable Opponent Shaping Experiments in JAX☆21Updated 7 months ago
- ☆16Updated 2 years ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆25Updated last year
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆15Updated 2 years ago
- ☆20Updated 6 months ago
- Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting☆16Updated 9 months ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆35Updated 2 weeks ago
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆12Updated 4 months ago
- Image-based gridworld experiment for learning Markov state abstractions☆19Updated last month
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆22Updated 7 months ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20Updated 2 years ago
- Generalised UDRL☆37Updated 2 years ago
- ☆37Updated 2 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆14Updated 2 weeks ago