google-deepmind / nanodoLinks
☆273Updated 11 months ago
Alternatives and similar repositories for nanodo
Users that are interested in nanodo are comparing it to the libraries listed below
Sorting:
- seqax = sequence modeling + JAX☆163Updated 3 weeks ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆603Updated this week
- 🧱 Modula software package☆202Updated 3 months ago
- ☆132Updated last week
- A simple library for scaling up JAX programs☆139Updated 8 months ago
- Efficient optimizers☆232Updated this week
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆424Updated this week
- JAX Synergistic Memory Inspector☆175Updated 11 months ago
- Named Tensors for Legible Deep Learning in JAX☆185Updated this week
- LoRA for arbitrary JAX models and functions☆140Updated last year
- Cost aware hyperparameter tuning algorithm☆162Updated last year
- Implementation of Diffusion Transformer (DiT) in JAX☆278Updated last year
- ☆440Updated 8 months ago
- Minimal but scalable implementation of large language models in JAX☆35Updated last week
- ☆511Updated last year
- Puzzles for exploring transformers☆354Updated 2 years ago
- ☆229Updated 5 months ago
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆24Updated 9 months ago
- For optimization algorithm research and development.☆521Updated this week
- A MAD laboratory to improve AI architecture designs 🧪☆123Updated 6 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆141Updated 2 weeks ago
- jax-triton contains integrations between JAX and OpenAI Triton☆405Updated 2 weeks ago
- Jax/Flax rewrite of Karpathy's nanoGPT☆58Updated 2 years ago
- Library for reading and processing ML training data.☆474Updated this week
- ☆259Updated this week
- Accelerated First Order Parallel Associative Scan☆182Updated 10 months ago
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆386Updated last week
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆288Updated 10 months ago
- JAX-Toolbox☆320Updated this week
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆288Updated this week