alvarobartt / safejax
Serialize JAX, Flax, Haiku, or Objax model params with ๐ค`safetensors`
โ44Updated 10 months ago
Alternatives and similar repositories for safejax:
Users that are interested in safejax are comparing it to the libraries listed below
- Automatically take good care of your preemptible TPUsโ36Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Trainingโ49Updated last year
- LoRA for arbitrary JAX models and functionsโ135Updated last year
- โ60Updated 3 years ago
- โ53Updated last year
- Experiment of using Tangent to autodiff tritonโ78Updated last year
- Train vision models using JAX and ๐ค transformersโ97Updated 2 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.โ30Updated 3 months ago
- โ87Updated 2 weeks ago
- Named Tensors for Legible Deep Learning in JAXโ168Updated this week
- Running Jax in PyTorch Lightningโ90Updated 3 months ago
- If it quacks like a tensor...โ57Updated 4 months ago
- This is a port of Mistral-7B model in JAXโ32Updated 9 months ago
- PyTorch interface for TrueGrad Optimizersโ42Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)โ32Updated 10 months ago
- โ76Updated 8 months ago
- Image augmentation library for Jaxโ39Updated 11 months ago
- JAX Implementation of Black Forest Labs' Flux.1 family of modelsโ30Updated 5 months ago
- โ17Updated 7 months ago
- โ19Updated this week
- Machine Learning eXperiment Utilitiesโ46Updated 9 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.โ17Updated 2 weeks ago
- JAX Synergistic Memory Inspectorโ171Updated 8 months ago
- some common Huggingface transformers in maximal update parametrization (ยตP)โ80Updated 3 years ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAXโ83Updated last year
- โ67Updated 2 years ago
- A metrics library for the JAX ecosystemโ40Updated 2 years ago
- โ20Updated last year
- A stateful pytree library for training neural networks.โ21Updated 2 years ago
- โ79Updated 11 months ago