alvarobartt / safejaxLinks
Serialize JAX, Flax, Haiku, or Objax model params with π€`safetensors`
β46Updated last year
Alternatives and similar repositories for safejax
Users that are interested in safejax are comparing it to the libraries listed below
Sorting:
- HomebrewNLP in JAX flavour for maintable TPU-Trainingβ50Updated last year
- Automatically take good care of your preemptible TPUsβ36Updated 2 years ago
- β60Updated 3 years ago
- LoRA for arbitrary JAX models and functionsβ142Updated last year
- β62Updated 3 years ago
- Image augmentation library for Jaxβ40Updated last year
- Train vision models using JAX and π€ transformersβ100Updated 2 weeks ago
- A case study of efficient training of large language models using commodity hardware.β68Updated 3 years ago
- If it quacks like a tensor...β59Updated 10 months ago
- minGPT in JAXβ48Updated 3 years ago
- A place to store reusable transformer components of my own creation or found on the interwebsβ60Updated last week
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAXβ89Updated last year
- Experiment of using Tangent to autodiff tritonβ81Updated last year
- β31Updated 3 months ago
- A functional training loops library for JAXβ88Updated last year
- JAX Implementation of Black Forest Labs' Flux.1 family of modelsβ38Updated last month
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.β46Updated last year
- β115Updated 3 weeks ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)β189Updated 3 years ago
- Fast, Modern, and Low Precision PyTorch Optimizersβ112Updated last month
- Amos optimizer with JEstimator lib.β82Updated last year
- My explorations into editing the knowledge and memories of an attention networkβ35Updated 2 years ago
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)β93Updated 10 months ago
- β20Updated 2 years ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimatorβ31Updated 2 years ago
- JAX Synergistic Memory Inspectorβ180Updated last year
- An implementation of the Llama architecture, to instruct and delightβ21Updated 4 months ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.β123Updated 2 weeks ago
- Neural Networks for JAXβ84Updated last year
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation preconditionβ¦β183Updated last week