google-deepmind / penzai
A JAX research toolkit for building, editing, and visualizing neural networks.
☆1,714Updated last month
Alternatives and similar repositories for penzai:
Users that are interested in penzai are comparing it to the libraries listed below
- Open weights language model from Google DeepMind, based on Griffin.☆614Updated 6 months ago
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆373Updated last month
- Puzzles for learning Triton☆1,300Updated last month
- What would you do with 1000 H100s...☆948Updated last year
- Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors a…☆1,257Updated this week
- A modern model graph visualizer and debugger☆1,098Updated this week
- Schedule-Free Optimization in PyTorch☆2,061Updated last month
- JAX - A curated list of resources https://github.com/google/jax☆1,660Updated 6 months ago
- Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/☆1,300Updated last week
- NanoGPT (124M) in 3.4 minutes☆2,068Updated last week
- Optax is a gradient processing and optimization library for JAX.☆1,754Updated this week
- Minimalistic 4D-parallelism distributed training framework for education purpose☆644Updated this week
- Tile primitives for speedy kernels☆1,923Updated this week
- ☆413Updated 2 months ago
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆486Updated 2 months ago
- TensorDict is a pytorch dedicated tensor container.☆862Updated this week
- For optimization algorithm research and development.☆484Updated this week
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆752Updated this week
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆605Updated last month
- Dermatology ddx dataset, Jax implementations of Monte Carlo conformal prediction, plausibility regions and statistical annotation aggrega…☆642Updated 9 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆534Updated this week
- A simple, performant and scalable Jax LLM!☆1,587Updated this week
- JAX-based neural network library☆2,939Updated last month
- Tensors, for human consumption☆1,178Updated last month
- Puzzles for exploring transformers☆331Updated last year
- Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/☆2,193Updated last week
- UNet diffusion model in pure CUDA☆596Updated 6 months ago
- Annotated version of the Mamba paper☆469Updated 10 months ago
- Library for reading and processing ML training data.☆355Updated this week
- Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)☆1,239Updated last month