FlorianDietz / comgraLinks
A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different angles at the click of a button.
☆289Updated 9 months ago
Alternatives and similar repositories for comgra
Users that are interested in comgra are comparing it to the libraries listed below
Sorting:
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆445Updated last month
- A pure NumPy implementation of Mamba.☆224Updated last year
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆349Updated last year
- Automatic gradient descent☆210Updated 2 years ago
- TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conte…☆120Updated 6 months ago
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆291Updated last year
- ☆144Updated 2 years ago
- 🤖 A PyTorch library of curated Transformer models and their composable components☆895Updated last year
- Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers☆113Updated last year
- A repository for log-time feedforward networks☆223Updated last year
- git extension for {collaborative, communal, continual} model development☆216Updated 10 months ago
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆212Updated 9 months ago
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆622Updated 5 months ago
- Documented and Unit Tested educational Deep Learning framework with Autograd from scratch.☆120Updated last year
- a small code base for training large models☆310Updated 4 months ago
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆396Updated this week
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆373Updated last year
- Python library for designing and training your own Diffusion Models with PyTorch☆288Updated 3 months ago
- run paligemma in real time☆132Updated last year
- Visualize the intermediate output of Mistral 7B☆368Updated 7 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆650Updated 3 months ago
- Puzzles for exploring transformers☆368Updated 2 years ago
- An interactive exploration of Transformer programming.☆269Updated last year
- Memory mapped numpy arrays of varying shapes☆301Updated last year
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆125Updated 4 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆186Updated last year
- An implementation of PSGD Kron second-order optimizer for PyTorch☆96Updated last month
- Universal Tensor Operations in Einstein-Inspired Notation for Python.☆408Updated 5 months ago
- ☆307Updated last year
- ☆150Updated last year