jax-ml / australis
☆29Updated last year
Related projects: ⓘ
- ☆27Updated 4 months ago
- Automatically take good care of your preemptible TPUs☆28Updated last year
- ☆56Updated 2 years ago
- ☆27Updated this week
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆41Updated 3 months ago
- ☆17Updated 4 months ago
- A JAX implementation of stochastic addition.☆12Updated 2 years ago
- Experiment of using Tangent to autodiff triton☆66Updated 7 months ago
- ☆56Updated 2 years ago
- This is a port of Mistral-7B model in JAX☆29Updated 2 months ago
- PyTorch interface for TrueGrad Optimizers☆39Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆46Updated 8 months ago
- ☆28Updated this week
- ☆38Updated last year
- Neural Networks for JAX☆82Updated 2 months ago
- A collection of optimizers, some arcane others well known, for Flax.☆29Updated 3 years ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆29Updated last year
- A case study of efficient training of large language models using commodity hardware.☆68Updated 2 years ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…☆57Updated last month
- A metrics library for the JAX ecosystem☆36Updated last year
- This repository contains example code to build models on TPUs☆30Updated last year
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆57Updated 10 months ago
- ☆24Updated last year
- Various transformers for FSDP research☆31Updated last year
- A place to store reusable transformer components of my own creation or found on the interwebs☆43Updated 3 weeks ago
- Latent Diffusion Language Models☆66Updated last year
- Utilities for PyTorch distributed☆23Updated 11 months ago
- minGPT in JAX☆45Updated 2 years ago
- A port of muP to JAX/Haiku☆25Updated last year
- RWKV model implementation☆38Updated last year