p-doom / jasmineLinks
A simple, performant and scalable JAX-based world modeling codebase.
☆119Updated 2 months ago
Alternatives and similar repositories for jasmine
Users that are interested in jasmine are comparing it to the libraries listed below
Sorting:
- ☆122Updated 7 months ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆116Updated last year
- Synchronized Curriculum Learning for RL Agents☆117Updated 2 months ago
- Minimal but scalable implementation of large language models in JAX☆35Updated last month
- ☆133Updated last month
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆40Updated last year
- A simple library for scaling up JAX programs☆144Updated 2 months ago
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025☆81Updated 10 months ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆72Updated last year
- Supporting code for the blog post on modular manifolds.☆111Updated 3 months ago
- Efficient baselines for autocurricula in JAX.☆206Updated last year
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆35Updated 6 months ago
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆148Updated 8 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Updated last year
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆63Updated 2 weeks ago
- ☆35Updated last year
- ☆51Updated 2 months ago
- Implementation of PSGD optimizer in JAX☆35Updated last year
- JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"☆97Updated 11 months ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- GPT implementation in Flax☆18Updated 4 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆120Updated last year
- Cost aware hyperparameter tuning algorithm☆177Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆107Updated last month
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆29Updated last year
- ☆91Updated 4 months ago
- Accelerated replay buffers in JAX☆46Updated 3 years ago
- 📄Small Batch Size Training for Language Models☆79Updated 3 months ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆21Updated 3 months ago