danijar / granularLinks
Fast dataset format and loader
☆22Updated 7 months ago
Alternatives and similar repositories for granular
Users that are interested in granular are comparing it to the libraries listed below
Sorting:
- Building blocks for productive research☆59Updated 3 weeks ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- GPT implementation in Flax☆18Updated 3 years ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- ☆31Updated last year
- Fast and reliable distributed systems in Python☆28Updated 4 months ago
- PyTorch Package For Quasimetric Learning☆42Updated 10 months ago
- Reinforcement Learning inside a 3D soccer simulation☆29Updated 11 months ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆31Updated 2 years ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆71Updated last year
- Accelerated replay buffers in JAX☆43Updated 2 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated last year
- Tools and Utils for Experiments (TUX)☆15Updated 7 months ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆33Updated last month
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated last year
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆68Updated last year
- Corax: Core RL in JAX☆38Updated last year
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆58Updated last year
- ☆13Updated last year
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆11Updated 2 years ago
- General Modules for JAX☆67Updated 4 months ago
- ☆31Updated 9 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆18Updated 10 months ago
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆55Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 3 years ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated 2 years ago
- Learning Robust Dynamics Through Variational Sparse Gating☆20Updated 2 years ago
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆27Updated last year
- Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.☆27Updated last year