google-deepmind / treescope
An interactive HTML pretty-printer for machine learning research in IPython notebooks.
☆411Updated last week
Alternatives and similar repositories for treescope:
Users that are interested in treescope are comparing it to the libraries listed below
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆607Updated last month
- ☆217Updated 9 months ago
- A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different…☆287Updated 4 months ago
- Universal Tensor Operations in Einstein-Inspired Notation for Python.☆367Updated 3 weeks ago
- A simple & elegant experiment tracking framework that integrates persistence logic & best practices directly into Python☆524Updated 3 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,769Updated last week
- ☆430Updated 6 months ago
- A pure NumPy implementation of Mamba.☆222Updated 9 months ago
- A Jax-based library for designing and training small transformers.☆286Updated 8 months ago
- ☆246Updated 7 months ago
- 🧱 Modula software package☆188Updated last month
- Open weights language model from Google DeepMind, based on Griffin.☆636Updated 2 months ago
- ☆241Updated last year
- TensorDict is a pytorch dedicated tensor container.☆920Updated this week
- For optimization algorithm research and development.☆509Updated this week
- Uncertainty quantification with PyTorch☆354Updated 3 weeks ago
- Compositional Linear Algebra☆475Updated last month
- ☆150Updated 8 months ago
- Puzzles for exploring transformers☆344Updated 2 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆569Updated this week
- Scalable and Performant Data Loading☆247Updated this week
- ☆236Updated 3 months ago
- Machine Learning with Symbolic Tensors☆267Updated 2 months ago
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆258Updated last week
- Named Tensors for Legible Deep Learning in JAX☆172Updated this week
- Cost aware hyperparameter tuning algorithm☆150Updated 10 months ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆123Updated 2 weeks ago
- An implementation of PSGD Kron second-order optimizer for PyTorch☆89Updated last month
- Library for reading and processing ML training data.☆434Updated this week
- Best practices & guides on how to write distributed pytorch training code☆406Updated 2 months ago