sfujim / SR-DICEView external linksLinks
Author's PyTorch implementation of SR-DICE for marginalized importance sampling
☆28Dec 7, 2021Updated 4 years ago
Alternatives and similar repositories for SR-DICE
Users that are interested in SR-DICE are comparing it to the libraries listed below
Sorting:
- Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization☆35Oct 22, 2020Updated 5 years ago
- Author's PyTorch implementation of LAP and PAL with TD3 and DDQN☆39Dec 7, 2021Updated 4 years ago
- Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"☆11Oct 3, 2023Updated 2 years ago
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆13May 28, 2025Updated 8 months ago
- ☆10Aug 17, 2022Updated 3 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Feb 3, 2022Updated 4 years ago
- Dynamic Simulation Environments for Reinforcement Learning☆13Apr 17, 2021Updated 4 years ago
- JAX implementation of Graph Attention Networks☆13Jan 29, 2022Updated 4 years ago
- Made for a reading group at the Center for Safe AGI.☆12Oct 27, 2022Updated 3 years ago
- Train, evaluate, and optimize implicit feedback-based recommender systems.☆31Jul 10, 2025Updated 7 months ago
- ☆56Jun 6, 2023Updated 2 years ago
- Tutorials on learning and using successor representations.☆54Oct 31, 2019Updated 6 years ago
- Code for ICML2023 Paper: Continuation Path Learning for Homotopy Optimization☆13Dec 31, 2025Updated last month
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆55Jul 27, 2021Updated 4 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Feb 1, 2020Updated 6 years ago
- ☆18Apr 22, 2024Updated last year
- Library to compare and evaluate reward functions☆67Oct 23, 2023Updated 2 years ago
- Representation Learning in RL☆13Jun 1, 2022Updated 3 years ago
- A lightweight reimplementation of Adversarially Trained Actor Critic☆19Sep 11, 2023Updated 2 years ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆17Dec 24, 2017Updated 8 years ago
- ☆18Jul 25, 2024Updated last year
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Jul 16, 2024Updated last year
- ☆40Nov 23, 2021Updated 4 years ago
- Mirror Descent Policy Optimization☆42Oct 31, 2020Updated 5 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- ☆15Apr 5, 2023Updated 2 years ago
- Code for the paper "Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation"☆19Jul 11, 2023Updated 2 years ago
- Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.☆18Mar 16, 2022Updated 3 years ago
- [AAAI-25] Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning.☆27May 29, 2025Updated 8 months ago
- Learning Laplacian Representations in Reinforcement Learning☆18Jan 2, 2021Updated 5 years ago
- Reinforcement learning algorithms☆41Feb 27, 2019Updated 6 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Jul 18, 2025Updated 6 months ago
- N-Layered FeUdal Networks based on FeUdal Networks adapted to suit PySC2 observations☆18Sep 17, 2019Updated 6 years ago
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆20Dec 16, 2018Updated 7 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 3 years ago
- Self-implemented code for Model-Based Meta-Reinforcement Learning☆17Apr 28, 2019Updated 6 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆161Sep 12, 2023Updated 2 years ago
- Experiments with Message Passing GNNs in C++ and PyTorch.☆26Jul 25, 2024Updated last year