maxreciprocate / offlineLinks
Offline RL experiments
☆15Updated 3 years ago
Alternatives and similar repositories for offline
Users that are interested in offline are comparing it to the libraries listed below
Sorting:
- Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"☆136Updated 2 years ago
- PyTorch Package For Quasimetric Learning☆44Updated last year
- GPT implementation in Flax☆18Updated 3 years ago
- Building blocks for productive research☆64Updated 4 months ago
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆22Updated last year
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆56Updated last year
- ☆19Updated 2 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆57Updated last year
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆48Updated 2 years ago
- ☆15Updated 2 years ago
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆161Updated 2 years ago
- Implementation of BC-IRL and other IRL baselines☆28Updated 2 years ago
- Sandbox environment for generalizable agent research☆25Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆118Updated last year
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆21Updated 2 months ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated last year
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated 2 years ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆31Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆30Updated 3 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆106Updated 3 years ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆68Updated 4 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30Updated 3 years ago
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆32Updated 3 years ago
- Learning Robust Dynamics Through Variational Sparse Gating☆20Updated 3 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 3 years ago
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 5 years ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Updated 2 years ago
- ☆19Updated 3 years ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆77Updated last year
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆47Updated last year