google-deepmind / penzai
A JAX research toolkit for building, editing, and visualizing neural networks.
☆1,746Updated 3 months ago
Alternatives and similar repositories for penzai:
Users that are interested in penzai are comparing it to the libraries listed below
- Schedule-Free Optimization in PyTorch☆2,116Updated 3 weeks ago
- Tile primitives for speedy kernels☆2,153Updated this week
- Open weights language model from Google DeepMind, based on Griffin.☆627Updated last month
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆398Updated this week
- TensorDict is a pytorch dedicated tensor container.☆898Updated this week
- What would you do with 1000 H100s...☆1,016Updated last year
- Puzzles for learning Triton☆1,508Updated 4 months ago
- Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/☆2,263Updated this week
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆601Updated 3 months ago
- Optax is a gradient processing and optimization library for JAX.☆1,828Updated last week
- JAX - A curated list of resources https://github.com/google/jax☆1,731Updated 3 weeks ago
- For optimization algorithm research and development.☆498Updated this week
- A modern model graph visualizer and debugger☆1,141Updated last week
- Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/☆1,348Updated this week
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆557Updated this week
- A simple, performant and scalable Jax LLM!☆1,653Updated this week
- Minimalistic 4D-parallelism distributed training framework for education purpose☆935Updated 2 weeks ago
- NanoGPT (124M) in 3 minutes☆2,403Updated this week
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆783Updated 2 weeks ago
- ☆420Updated 5 months ago
- Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors a…☆1,310Updated this week
- A Jax-based library for designing and training transformer models from scratch.☆282Updated 6 months ago
- Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)☆1,252Updated 3 months ago
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆524Updated 4 months ago
- A PyTorch native library for large model training☆3,470Updated this week
- Textbook on reinforcement learning from human feedback☆488Updated this week
- A library for mechanistic interpretability of GPT-style language models☆1,960Updated last week
- ☆407Updated 8 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆269Updated 9 months ago
- ☆832Updated this week