kaixin96 / mixregView external linksLinks
Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization
☆35Oct 22, 2020Updated 5 years ago
Alternatives and similar repositories for mixreg
Users that are interested in mixreg are comparing it to the libraries listed below
Sorting:
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Dec 7, 2021Updated 4 years ago
- ☆54Feb 28, 2024Updated last year
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Jul 16, 2024Updated last year
- ☆20May 1, 2024Updated last year
- Code to reproduce the NeurIPS 2019 paper "Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottlen…☆52Jun 28, 2020Updated 5 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆56Apr 27, 2020Updated 5 years ago
- PyTorch implementation for all methods and environments in the paper "MIMEx: Intrinsic Rewards from Masked Input Modeling"☆16May 17, 2023Updated 2 years ago
- Source code for NeurIPS 2020 paper "Node Classification on Graphs with Few-Shot Novel Labels via Meta Transformed Network Embedding"☆10Nov 17, 2020Updated 5 years ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Mar 9, 2021Updated 4 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆70Aug 8, 2022Updated 3 years ago
- This repository contains the code of the paper Equivariant Q Learning in Spatial Action Spaces☆11Nov 4, 2021Updated 4 years ago
- Automatic Recall Machines: Internal Replay, Continual Learning and the Brain☆11Jul 14, 2020Updated 5 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning☆11Jun 16, 2022Updated 3 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆31Jul 27, 2021Updated 4 years ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆46Nov 22, 2022Updated 3 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆84Dec 13, 2019Updated 6 years ago
- ☆11Aug 2, 2022Updated 3 years ago
- SynPick dataset generator☆13Jul 8, 2021Updated 4 years ago
- ☆13Jul 9, 2018Updated 7 years ago
- [ICASSP 2020] Code release of paper 'Heterogeneous Domain Generalization via Domain Mixup'☆26Aug 3, 2020Updated 5 years ago
- Pytorch implementation for "Dynamic Instance Domain Adaptation" (DIDA-Net, accepted to IEEE T-IP).☆12May 6, 2024Updated last year
- Train, evaluate, and optimize implicit feedback-based recommender systems.☆31Jul 10, 2025Updated 7 months ago
- ☆30Feb 20, 2021Updated 4 years ago
- ☆88Jul 30, 2024Updated last year
- Disagreement-Regularized Imitation Learning☆30May 25, 2021Updated 4 years ago
- Official PyTorch implementation of "Rethinking Value Function Learning for Generalization in Reinforcement Learning" (NeurIPS 2022)☆14Feb 20, 2023Updated 2 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆33Dec 14, 2023Updated 2 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆28Feb 10, 2022Updated 4 years ago
- [CoRL 2020] COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning☆34Oct 28, 2020Updated 5 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆55Jul 27, 2021Updated 4 years ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆31Sep 10, 2020Updated 5 years ago
- DMControl Generalization Benchmark☆188Jan 3, 2024Updated 2 years ago
- ☆14Jun 6, 2020Updated 5 years ago
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Jul 14, 2021Updated 4 years ago
- Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".☆14May 23, 2021Updated 4 years ago
- Code accompanying the paper "Information Directed Reward Learning for Reinforcement Learning" (NeurIPS 2021).☆13Nov 16, 2021Updated 4 years ago
- ☆15Oct 11, 2022Updated 3 years ago
- ☆32Feb 21, 2021Updated 4 years ago