jax-ml / jax-tpu-embedding
☆13Updated this week
Alternatives and similar repositories for jax-tpu-embedding:
Users that are interested in jax-tpu-embedding are comparing it to the libraries listed below
- ☆50Updated 5 months ago
- Two implementations of ZeRO-1 optimizer sharding in JAX☆13Updated last year
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆104Updated last week
- Experimenting with how best to do multi-host dataloading☆10Updated 2 years ago
- Machine Learning eXperiment Utilities☆45Updated 7 months ago
- ☆181Updated 3 weeks ago
- ☆58Updated 2 years ago
- ☆126Updated this week
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆96Updated this week
- Experiment of using Tangent to autodiff triton☆74Updated 11 months ago
- A simple library for scaling up JAX programs☆129Updated 2 months ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆42Updated 7 months ago
- Explore training for quantized models☆12Updated last week
- Jax/Flax rewrite of Karpathy's nanoGPT☆54Updated last year
- A set of Python scripts that makes your experience on TPU better☆44Updated 6 months ago
- Named Tensors for Legible Deep Learning in JAX☆159Updated last week
- ☆73Updated this week
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Updated 7 months ago
- Inference code for LLaMA models in JAX☆114Updated 7 months ago
- ☆21Updated 2 months ago
- Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)☆64Updated last month
- Code associated to papers on superposition (in ML interpretability)☆26Updated 2 years ago
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆46Updated this week
- ☆106Updated this week
- some common Huggingface transformers in maximal update parametrization (µP)☆78Updated 2 years ago
- Blazing fast training of 🤗 Transformers on Graphcore IPUs☆84Updated 10 months ago
- Train very large language models in Jax.☆198Updated last year
- Causal Analysis of Agent Behavior for AI Safety☆17Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆14Updated 2 weeks ago