maxreciprocate / offlineLinks
Offline RL experiments
☆15Updated 2 years ago
Alternatives and similar repositories for offline
Users that are interested in offline are comparing it to the libraries listed below
Sorting:
- Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"☆135Updated 2 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆56Updated last year
- ☆15Updated 2 years ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Updated 2 years ago
- ☆19Updated 2 years ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆53Updated 9 months ago
- PyTorch Package For Quasimetric Learning☆43Updated 10 months ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated last year
- Building blocks for productive research☆61Updated last month
- Pre-Trained Language Models for Interactive Decision-Making [NeurIPS 2022]☆130Updated 3 years ago
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆48Updated 2 years ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆67Updated 4 years ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated last year
- Implementation of BC-IRL and other IRL baselines☆28Updated 2 years ago
- Implements the Messenger environment and EMMA model.☆25Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 3 years ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆31Updated 2 years ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆21Updated this week
- Official PyTorch implementation of "Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning" (NeurIPS 20…☆35Updated 7 months ago
- Adaptable Agent Populations via a Generative Model of Policies☆13Updated 3 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆158Updated 2 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆105Updated 3 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆59Updated 11 months ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆68Updated last year
- Sandbox environment for generalizable agent research☆26Updated 3 years ago
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆21Updated 11 months ago
- Reinforcement Learning via Supervised Learning☆71Updated 3 years ago
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆44Updated last year