alvarobartt / safejaxLinks
Serialize JAX, Flax, Haiku, or Objax model params with ๐ค`safetensors`
โ44Updated last year
Alternatives and similar repositories for safejax
Users that are interested in safejax are comparing it to the libraries listed below
Sorting:
- Automatically take good care of your preemptible TPUsโ36Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Trainingโ50Updated last year
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.โ30Updated last week
- Train vision models using JAX and ๐ค transformersโ97Updated last month
- Experiment of using Tangent to autodiff tritonโ79Updated last year
- โ53Updated last year
- โ19Updated 2 weeks ago
- LoRA for arbitrary JAX models and functionsโ136Updated last year
- โ33Updated 8 months ago
- โ78Updated 11 months ago
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizersโ93Updated 10 months ago
- AdamW optimizer for bfloat16 models in pytorch ๐ฅ.โ32Updated 11 months ago
- Inference code for LLaMA models in JAXโ117Updated last year
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergenceโ60Updated 3 years ago
- PyTorch interface for TrueGrad Optimizersโ42Updated last year
- JAX Implementation of Black Forest Labs' Flux.1 family of modelsโ33Updated 7 months ago
- โ20Updated last year
- โ60Updated 3 years ago
- some common Huggingface transformers in maximal update parametrization (ยตP)โ80Updated 3 years ago
- Mobile Viewer for W&B, built on top of Flutter.โ34Updated last year
- DiCE: The Infinitely Differentiable Monte-Carlo Estimatorโ31Updated last year
- This is a port of Mistral-7B model in JAXโ32Updated 11 months ago
- โ17Updated 9 months ago
- โ18Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.โ17Updated 2 months ago
- An implementation of the Llama architecture, to instruct and delightโ21Updated this week
- minGPT in JAXโ48Updated 3 years ago
- โ23Updated 5 months ago
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.โ18Updated 11 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAXโ81Updated last year