maxreciprocate / offlineLinks
Offline RL experiments
☆15Updated 3 years ago
Alternatives and similar repositories for offline
Users that are interested in offline are comparing it to the libraries listed below
Sorting:
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆57Updated last year
- Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"☆136Updated 2 years ago
- ☆19Updated 2 years ago
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆32Updated 3 years ago
- Building blocks for productive research☆67Updated 3 weeks ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆29Updated 3 years ago
- Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization☆35Updated 5 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆122Updated last year
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 3 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated last year
- ☆15Updated 2 years ago
- GPT implementation in Flax☆18Updated 4 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆106Updated 3 years ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated 2 years ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆33Updated 2 years ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Updated 2 years ago
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆162Updated last month
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆21Updated 4 months ago
- Reinforcement Learning via Supervised Learning☆72Updated 3 years ago
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆48Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆30Updated 4 years ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆41Updated last year
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆23Updated last year
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Updated 4 years ago
- PyTorch Package For Quasimetric Learning☆45Updated last year
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆47Updated last year
- Sandbox environment for generalizable agent research☆27Updated 3 years ago
- ☆17Updated 3 years ago
- Implements the Messenger environment and EMMA model.☆25Updated 2 years ago