maxreciprocate / offlineLinks
Offline RL experiments
☆15Updated 2 years ago
Alternatives and similar repositories for offline
Users that are interested in offline are comparing it to the libraries listed below
Sorting:
- Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"☆134Updated last year
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆56Updated last year
- Implements the Messenger environment and EMMA model.☆25Updated 2 years ago
- PyTorch Package For Quasimetric Learning☆42Updated 9 months ago
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆46Updated 2 years ago
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆31Updated 2 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- ☆54Updated 9 months ago
- Code repository complementing the ICLR 2021 paper "Unsupervised Object Keypoint Learning using Local Spatial Predictability" (https://arx…☆9Updated 7 months ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated last year
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated last year
- Adaptable Agent Populations via a Generative Model of Policies☆13Updated 3 years ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆68Updated 11 months ago
- ☆19Updated 2 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- ☆55Updated 2 years ago
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆19Updated 9 months ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆22Updated 4 years ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆20Updated last year
- Pre-Trained Language Models for Interactive Decision-Making [NeurIPS 2022]☆128Updated 3 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆157Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 3 years ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆54Updated 8 months ago
- ☆31Updated 4 years ago
- ☆20Updated 2 years ago
- Original tensorflow implementation of SILOT (Spatially Invariant, Label-free Object Tracking).☆13Updated 2 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 4 years ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆66Updated 4 years ago