maxreciprocate / offlineLinks
Offline RL experiments
☆15Updated 2 years ago
Alternatives and similar repositories for offline
Users that are interested in offline are comparing it to the libraries listed below
Sorting:
- Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"☆134Updated 2 years ago
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆31Updated 2 years ago
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆47Updated 2 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆56Updated last year
- PyTorch Package For Quasimetric Learning☆42Updated 10 months ago
- ☆55Updated 9 months ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- ☆19Updated 2 years ago
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆21Updated 10 months ago
- GPT implementation in Flax☆18Updated 3 years ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated last year
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction (ICML 2023)☆48Updated 2 years ago
- Implements the Messenger environment and EMMA model.☆25Updated 2 years ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆20Updated last year
- Building blocks for productive research☆59Updated last month
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆54Updated 8 months ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- ☆10Updated 2 years ago
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆158Updated 2 years ago
- Original tensorflow implementation of SILOT (Spatially Invariant, Label-free Object Tracking).☆13Updated 2 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated last year
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆31Updated 2 years ago
- Adaptable Agent Populations via a Generative Model of Policies☆13Updated 3 years ago
- Learning Robust Dynamics Through Variational Sparse Gating☆20Updated 2 years ago
- Accelerated replay buffers in JAX☆43Updated 2 years ago
- Sandbox environment for generalizable agent research☆26Updated 3 years ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆66Updated 4 years ago
- Generalised UDRL☆37Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated last year
- ☆15Updated 2 years ago