jdchang1 / milo
☆15Updated 3 years ago
Alternatives and similar repositories for milo:
Users that are interested in milo are comparing it to the libraries listed below
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆15Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆52Updated 3 years ago
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models☆31Updated 3 years ago
- Representation Learning in RL☆16Updated 2 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 4 years ago
- ☆29Updated 2 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 6 months ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆19Updated 3 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Updated 3 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆38Updated 2 months ago
- Generalised UDRL☆37Updated 2 years ago
- ☆15Updated 4 years ago
- ☆41Updated 3 years ago
- ☆32Updated 5 months ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆25Updated 2 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆22Updated 9 months ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- Implementation of Tactical Optimistic and Pessimistic value estimation☆24Updated last year
- Change-Based Exploration Transfer☆36Updated 2 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆25Updated 3 years ago
- Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.☆24Updated last year
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Updated 5 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆60Updated last year
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆32Updated last year
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆36Updated 2 years ago
- ☆16Updated 2 years ago
- ☆18Updated last year
- ☆34Updated 2 years ago