stanford-crfm / levanter
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
☆492Updated this week
Related projects: ⓘ
- ☆176Updated 2 months ago
- ☆201Updated 2 months ago
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆445Updated last week
- ☆325Updated 11 months ago
- ☆253Updated this week
- seqax = sequence modeling + JAX☆129Updated 2 months ago
- Puzzles for exploring transformers☆293Updated last year
- JAX-Toolbox☆231Updated this week
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆321Updated 2 weeks ago
- Annotated version of the Mamba paper☆445Updated 6 months ago
- A Jax-based library for designing and training transformer models from scratch.☆271Updated 3 weeks ago
- Building blocks for foundation models.☆345Updated 8 months ago
- JAX Synergistic Memory Inspector☆161Updated 2 months ago
- Named Tensors for Legible Deep Learning in JAX☆146Updated this week
- jax-triton contains integrations between JAX and OpenAI Triton☆328Updated this week
- Orbax provides common checkpointing and persistence utilities for JAX users☆283Updated this week
- ☆172Updated last week
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆452Updated last week
- ☆322Updated 5 months ago
- ☆202Updated 4 months ago
- JAX implementation of the Llama 2 model☆205Updated 7 months ago
- What would you do with 1000 H100s...☆816Updated 8 months ago
- CLU lets you write beautiful training loops in JAX.☆318Updated 3 weeks ago
- ☆247Updated this week
- Inference code for LLaMA models in JAX☆108Updated 3 months ago
- An interactive exploration of Transformer programming.☆243Updated 10 months ago
- TensorDict is a pytorch dedicated tensor container.☆807Updated this week
- For optimization algorithm research and development.☆240Updated last week
- Tools for understanding how transformer predictions are built layer-by-layer☆408Updated 3 months ago
- A simple library for scaling up JAX programs☆116Updated last month