maxreciprocate / offlineLinks
Offline RL experiments
☆15Updated 3 years ago
Alternatives and similar repositories for offline
Users that are interested in offline are comparing it to the libraries listed below
Sorting:
- Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"☆136Updated 2 years ago
- PyTorch Package For Quasimetric Learning☆44Updated last year
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Updated 2 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆57Updated last year
- ☆19Updated 2 years ago
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆26Updated 2 months ago
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆22Updated last year
- Implementation of BC-IRL and other IRL baselines☆28Updated 2 years ago
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆32Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆120Updated last year
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆29Updated 3 years ago
- Implements the Messenger environment and EMMA model.☆25Updated 2 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated last year
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated 2 years ago
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆162Updated 2 weeks ago
- Learning Robust Dynamics Through Variational Sparse Gating☆20Updated 3 years ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆40Updated last year
- Rewarded soups official implementation☆62Updated 2 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆106Updated 3 years ago
- ☆32Updated 4 years ago
- Code for Contrastive Preference Learning (CPL)☆177Updated last year
- GPT implementation in Flax☆18Updated 3 years ago
- Pre-Trained Language Models for Interactive Decision-Making [NeurIPS 2022]☆130Updated 3 years ago
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆47Updated last year
- Sandbox environment for generalizable agent research☆25Updated 3 years ago
- Adaptable Agent Populations via a Generative Model of Policies☆12Updated 4 years ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆32Updated 2 years ago
- Building blocks for productive research☆66Updated 5 months ago
- Generalised UDRL☆37Updated 3 years ago
- Reinforcement Learning via Supervised Learning☆72Updated 3 years ago