google-deepmind / penzaiLinks
A JAX research toolkit for building, editing, and visualizing neural networks.
☆1,817Updated 2 months ago
Alternatives and similar repositories for penzai
Users that are interested in penzai are comparing it to the libraries listed below
Sorting:
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆445Updated last month
- Schedule-Free Optimization in PyTorch☆2,206Updated 3 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆651Updated 3 months ago
- A modern model graph visualizer and debugger☆1,311Updated last week
- Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/☆2,516Updated 3 weeks ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆658Updated this week
- PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily wri…☆1,410Updated last week
- JAX - A curated list of resources https://github.com/google/jax☆1,918Updated 2 weeks ago
- TensorDict is a pytorch dedicated tensor container.☆967Updated last week
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆622Updated 5 months ago
- Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)☆1,282Updated 9 months ago
- Optax is a gradient processing and optimization library for JAX.☆2,012Updated last week
- Library for reading and processing ML training data.☆535Updated this week
- Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/☆1,523Updated 4 months ago
- A simple, performant and scalable Jax LLM!☆1,899Updated this week
- ☆453Updated 10 months ago
- ☆534Updated last year
- What would you do with 1000 H100s...☆1,100Updated last year
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆605Updated this week
- For optimization algorithm research and development.☆536Updated this week
- Puzzles for exploring transformers☆370Updated 2 years ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆813Updated last month
- NanoGPT (124M) in 3 minutes☆3,117Updated 2 months ago
- Package for extracting and mapping the results of every single tensor operation in a PyTorch model in one line of code.☆613Updated 6 months ago
- ☆279Updated last year
- UNet diffusion model in pure CUDA☆648Updated last year
- Annotated version of the Mamba paper☆489Updated last year
- Tile primitives for speedy kernels☆2,688Updated this week
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆535Updated 2 weeks ago
- A platform for managing machine learning experiments☆870Updated 3 weeks ago