google-deepmind / penzai
A JAX research toolkit for building, editing, and visualizing neural networks.
☆1,769Updated last week
Alternatives and similar repositories for penzai:
Users that are interested in penzai are comparing it to the libraries listed below
- Open weights language model from Google DeepMind, based on Griffin.☆636Updated 2 months ago
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆411Updated last week
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆569Updated this week
- Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/☆2,336Updated last week
- Schedule-Free Optimization in PyTorch☆2,150Updated 3 weeks ago
- Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)☆1,252Updated 4 months ago
- Optax is a gradient processing and optimization library for JAX.☆1,875Updated this week
- Thunder gives you PyTorch models superpowers for training and inference. Unlock out-of-the-box optimizations for performance, memory and …☆1,335Updated this week
- ☆430Updated 6 months ago
- A simple, performant and scalable Jax LLM!☆1,708Updated this week
- TensorDict is a pytorch dedicated tensor container.☆920Updated this week
- Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/☆1,390Updated this week
- What would you do with 1000 H100s...☆1,043Updated last year
- JAX - A curated list of resources https://github.com/google/jax☆1,796Updated 2 months ago
- For optimization algorithm research and development.☆509Updated this week
- A modern model graph visualizer and debugger☆1,175Updated this week
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆607Updated last month
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,346Updated last month
- Library for reading and processing ML training data.☆434Updated this week
- ☆848Updated this week
- Puzzles for exploring transformers☆344Updated 2 years ago
- ☆446Updated 9 months ago
- NanoGPT (124M) in 3 minutes☆2,520Updated last week
- JAX-based neural network library☆3,021Updated this week
- Monte Carlo tree search in JAX☆2,473Updated 3 weeks ago
- ☆217Updated 9 months ago
- A Jax-based library for designing and training small transformers.☆286Updated 8 months ago
- A Graph Neural Network Library in Jax☆1,427Updated last year
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆491Updated last week
- Implementation of Diffusion Transformer (DiT) in JAX☆271Updated 10 months ago