alvarobartt / safejax
Serialize JAX, Flax, Haiku, or Objax model params with π€`safetensors`
β42Updated 5 months ago
Related projects β
Alternatives and complementary repositories for safejax
- Automatically take good care of your preemptible TPUsβ31Updated last year
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizersβ58Updated 3 months ago
- HomebrewNLP in JAX flavour for maintable TPU-Trainingβ46Updated 9 months ago
- Train vision models using JAX and π€ transformersβ95Updated 2 weeks ago
- β53Updated 9 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.β29Updated last week
- β56Updated 2 years ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.β35Updated 3 months ago
- β64Updated 2 years ago
- some common Huggingface transformers in maximal update parametrization (Β΅P)β76Updated 2 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)β32Updated 5 months ago
- β20Updated last year
- Experiment of using Tangent to autodiff tritonβ71Updated 9 months ago
- β72Updated 4 months ago
- A case study of efficient training of large language models using commodity hardware.β68Updated 2 years ago
- Mobile Viewer for W&B, built on top of Flutter.β30Updated 8 months ago
- Efficient optimizersβ42Updated this week
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAXβ78Updated 9 months ago
- β76Updated 6 months ago
- Multidimensional indexing for tensorsβ112Updated last year
- β76Updated 5 months ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" givenβ¦β14Updated last year
- β58Updated 2 years ago
- The 2D discrete wavelet transform for JAXβ38Updated last year
- β18Updated 6 months ago
- PyTorch interface for TrueGrad Optimizersβ39Updated last year
- My explorations into editing the knowledge and memories of an attention networkβ34Updated last year
- β31Updated 2 months ago
- A place to store reusable transformer components of my own creation or found on the interwebsβ43Updated this week