kaixin96/mixreg

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kaixin96/mixreg)

kaixin96 / mixreg

Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization

☆34

Alternatives and similar repositories for mixreg

Users that are interested in mixreg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rraileanu / idaac
View on GitHub
☆55Feb 28, 2024Updated 2 years ago
TalhaRehmanMTRKT / microgrid-optimization-examples
View on GitHub
☆20May 1, 2024Updated 2 years ago
rraileanu / policy-dynamics-value-functions
View on GitHub
☆33Aug 30, 2024Updated last year
Stanford-ILIAD / ELLA
View on GitHub
Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.
☆21Mar 9, 2021Updated 5 years ago
microsoft / IBAC-SNI
View on GitHub
Code to reproduce the NeurIPS 2019 paper "Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottlen…
☆52Jun 28, 2020Updated 6 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
aypan17 / reward-misspecification
View on GitHub
☆10Mar 13, 2023Updated 3 years ago
snu-mllab / DCPG
View on GitHub
Official PyTorch implementation of "Rethinking Value Function Learning for Generalization in Reinforcement Learning" (NeurIPS 2022)
☆15Feb 20, 2023Updated 3 years ago
taodav / nsrs
View on GitHub
Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.
☆14Jul 16, 2024Updated 2 years ago
pokaxpoka / netrand
View on GitHub
Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020
☆57Apr 27, 2020Updated 6 years ago
joonleesky / train-procgen-pytorch
View on GitHub
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆31Sep 10, 2020Updated 5 years ago
kaixin96 / rl-generalization-paper
View on GitHub
A list of papers regarding generalization in (deep) reinforcement learning
☆156Aug 12, 2023Updated 2 years ago
sfujim / SR-DICE
View on GitHub
Author's PyTorch implementation of SR-DICE for marginalized importance sampling
☆28Dec 7, 2021Updated 4 years ago
frt03 / generalized_dt
View on GitHub
Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)
☆70Aug 8, 2022Updated 3 years ago
rraileanu / auto-drac
View on GitHub
Automatic Data-Regularized Actor-Critic (Auto-DrAC)
☆104Mar 24, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
ToruOwO / mimex
View on GitHub
MIMEx: Intrinsic Rewards from Masked Input Modeling [NeurIPS 2023]
☆16May 17, 2023Updated 3 years ago
nicklashansen / dmcontrol-generalization-benchmark
View on GitHub
DMControl Generalization Benchmark
☆189Jan 3, 2024Updated 2 years ago
DesikRengarajan / LOGO
View on GitHub
[ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration
☆28Feb 10, 2022Updated 4 years ago
zihangJiang / Adaptive-Attention
View on GitHub
☆30Apr 6, 2021Updated 5 years ago
tgangwani / SelfImitationDiverse
View on GitHub
Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)
☆20Nov 26, 2020Updated 5 years ago
QData / dmc_remastered
View on GitHub
A version of the DeepMind Control Suite with randomly generated graphics, for measuring visual generalization in continuous control.
☆20Oct 19, 2020Updated 5 years ago
kavosh8 / Lip
View on GitHub
☆13Jul 9, 2018Updated 8 years ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
xuanlinli17 / iclr2021_rlreg
View on GitHub
Regularization Matters in Policy Optimization
☆21Nov 1, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
machine-teaching-group / neurips2022_exploration-guided-reward-shaping
View on GitHub
☆17Oct 11, 2022Updated 3 years ago
FLAG250 / hoshino-plugin-pjsk
View on GitHub
适用于hoshinobot的pjsk信息查询插件
☆14Mar 15, 2023Updated 3 years ago
mklissa / phi_gcn
View on GitHub
Reward Propagation using Graph Convolutional Networks
☆13Jun 19, 2021Updated 5 years ago
banma12956 / HIPI-RL
View on GitHub
☆10Jun 22, 2020Updated 6 years ago
apexrl / EBIL-torch
View on GitHub
Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>
☆12Oct 8, 2021Updated 4 years ago
Zhongying-Deng / DIDA
View on GitHub
Pytorch implementation for "Dynamic Instance Domain Adaptation" (DIDA-Net, accepted to IEEE T-IP).
☆12May 6, 2024Updated 2 years ago
illidanlab / opolo-code
View on GitHub
☆32Mar 4, 2021Updated 5 years ago
tohtsky / irspack
View on GitHub
Train, evaluate, and optimize implicit feedback-based recommender systems.
☆31Updated this week
eleurent / social-attention
View on GitHub
Social Attention for Autonomous Decision-Making in Dense Traffic
☆23Oct 30, 2021Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
wyf0912 / MIXALL
View on GitHub
[ICASSP 2020] Code release of paper 'Heterogeneous Domain Generalization via Domain Mixup'
☆26Aug 3, 2020Updated 5 years ago
chrhenning / posterior_replay_cl
View on GitHub
Continual learning of task-specific approximations of the parameter posterior distribution via a shared hypernetwork.
☆16Nov 1, 2024Updated last year
montaserFath / BCO
View on GitHub
behavior cloning from observation
☆38Dec 14, 2020Updated 5 years ago
llan-ml / MetaTNE
View on GitHub
Source code for NeurIPS 2020 paper "Node Classification on Graphs with Few-Shot Novel Labels via Meta Transformed Network Embedding"
☆10Nov 17, 2020Updated 5 years ago
clvrai / agile
View on GitHub
Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.
☆18Mar 16, 2022Updated 4 years ago
quanvuong / Supervised_Policy_Update
View on GitHub
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Dec 8, 2022Updated 3 years ago
xkianteb / dril
View on GitHub
Disagreement-Regularized Imitation Learning
☆30May 25, 2021Updated 5 years ago