maxreciprocate / offlineLinks
Offline RL experiments
☆15Updated 3 years ago
Alternatives and similar repositories for offline
Users that are interested in offline are comparing it to the libraries listed below
Sorting:
- PyTorch Package For Quasimetric Learning☆45Updated last year
- Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"☆136Updated 2 years ago
- Building blocks for productive research☆67Updated 2 weeks ago
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆32Updated 3 years ago
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆22Updated last year
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆48Updated 2 years ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆56Updated last year
- GPT implementation in Flax☆18Updated 4 years ago
- Adaptable Agent Populations via a Generative Model of Policies☆12Updated 4 years ago
- ☆19Updated 2 years ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆32Updated 2 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆57Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆122Updated last year
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated last year
- Atari-style POMDPs☆23Updated 3 weeks ago
- Docker containers of baseline agents for the Crafter environment☆30Updated 4 years ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆21Updated 4 months ago
- ☆35Updated last year
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆40Updated last year
- A collection of meta-learning algorithms in Jax☆24Updated 3 years ago
- Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021☆29Updated 4 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 3 years ago
- General Modules for JAX☆72Updated 4 months ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Updated 2 years ago
- Code for the paper "Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference"☆43Updated last year
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆162Updated last month
- ☆17Updated 3 years ago
- Accelerated replay buffers in JAX☆46Updated 3 years ago
- Original tensorflow implementation of SILOT (Spatially Invariant, Label-free Object Tracking).☆13Updated 2 years ago
- ☆44Updated 2 years ago