google-deepmind / penzaiLinks
A JAX research toolkit for building, editing, and visualizing neural networks.
☆1,811Updated 2 months ago
Alternatives and similar repositories for penzai
Users that are interested in penzai are comparing it to the libraries listed below
Sorting:
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆439Updated 2 weeks ago
- Open weights language model from Google DeepMind, based on Griffin.☆647Updated 2 months ago
- Schedule-Free Optimization in PyTorch☆2,202Updated 3 months ago
- Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/☆1,505Updated 3 months ago
- Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/☆2,472Updated last week
- A modern model graph visualizer and debugger☆1,300Updated last week
- PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily wri…☆1,388Updated last week
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆643Updated this week
- Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)☆1,277Updated 8 months ago
- TensorDict is a pytorch dedicated tensor container.☆955Updated this week
- NanoGPT (124M) in 3 minutes☆3,037Updated last month
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆622Updated 5 months ago
- ☆444Updated 10 months ago
- JAX - A curated list of resources https://github.com/google/jax☆1,904Updated 6 months ago
- What would you do with 1000 H100s...☆1,087Updated last year
- Optax is a gradient processing and optimization library for JAX.☆1,980Updated last week
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆812Updated 3 weeks ago
- A simple, performant and scalable Jax LLM!☆1,867Updated this week
- ☆526Updated last year
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆523Updated this week
- Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.☆1,589Updated last week
- Library for reading and processing ML training data.☆505Updated this week
- UNet diffusion model in pure CUDA☆615Updated last year
- For optimization algorithm research and development.☆530Updated this week
- ☆275Updated last year
- Puzzles for learning Triton☆1,925Updated 9 months ago
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆526Updated last week
- Puzzles for exploring transformers☆366Updated 2 years ago
- maximal update parametrization (µP)☆1,584Updated last year
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,673Updated last month