jdchang1 / miloLinks

☆16

Alternatives and similar repositories for milo

Users that are interested in milo are comparing it to the libraries listed below

Sorting:

sfujim / SR-DICE
Author's PyTorch implementation of SR-DICE for marginalized importance sampling
☆17Updated 3 years ago
rmrafailov / LOMPO
Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models
☆30Updated 4 years ago
ben-eysenbach / info_geometry
Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"
☆20Updated 3 years ago
google-deepmind / active_ops
☆32Updated last year
micahcarroll / uniMASK
Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"
☆56Updated last year
scottemmons / rvs
Reinforcement Learning via Supervised Learning
☆71Updated 3 years ago
Stanford-ILIAD / ELLA
Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.
☆21Updated 4 years ago
keynans / HypeRL
Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)
☆24Updated 4 years ago
mila-iqia / SGI
Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)
☆54Updated 4 years ago
montrealrobotics / iv_rl
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆40Updated 2 weeks ago
RockySJ / ampo
☆15Updated 4 years ago
frt03 / generalized_dt
Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)
☆67Updated 2 years ago
uber-research / D3G
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Updated 5 years ago
younggyoseo / CaDM
CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning
☆63Updated 5 years ago
yudasong / HyQ
Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.
☆24Updated 2 years ago
rraileanu / idaac
☆54Updated last year
ben-eysenbach / mnm
Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"
☆20Updated 3 years ago
dido1998 / CausalMBRL
Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning
☆48Updated 4 years ago
taodav / nsrs
Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.
☆14Updated last year
JasonMa2016 / CODAC
Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)
☆21Updated 4 years ago
suyoung-lee / LDM
Latent Dynamics Mixture, NeurIPS 2021
☆17Updated 2 years ago
tonyzhaozh / meld
MELD: Meta-Reinforcement Learning from Images via Latent State Models https://arxiv.org/abs/2010.13957
☆63Updated 4 years ago
apexrl / COIL
Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"
☆18Updated 2 years ago
tgangwani / BMIL
Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)
☆21Updated 3 years ago
JasonMa2016 / SMODICE
Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…
☆26Updated 2 years ago
younggyoseo / trajectory_mcl
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)
☆39Updated 4 years ago
sahandrez / homomorphic_policy_gradient
Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024
☆23Updated last year
ahmed-touati / controllable_agent
☆47Updated 2 years ago
nnaisense / MAGE
Learning Action-Value Gradients in Model-based Policy Optimization
☆31Updated 3 years ago
pairlab / vagram
[ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.
☆24Updated 2 years ago