FlorianDietz / comgraLinks
A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different angles at the click of a button.
☆293Updated 11 months ago
Alternatives and similar repositories for comgra
Users that are interested in comgra are comparing it to the libraries listed below
Sorting:
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆451Updated 3 months ago
- Automatic gradient descent☆215Updated 2 years ago
- A pure NumPy implementation of Mamba.☆223Updated last year
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆352Updated last year
- ☆144Updated 2 years ago
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆297Updated last year
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆286Updated 2 months ago
- run paligemma in real time☆133Updated last year
- Visualize the intermediate output of Mistral 7B☆381Updated 10 months ago
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆626Updated 8 months ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆125Updated 7 months ago
- TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conte…☆122Updated 9 months ago
- ☆248Updated last year
- Open weights language model from Google DeepMind, based on Griffin.☆654Updated 5 months ago
- Teaching transformers to play chess☆142Updated 10 months ago
- a small code base for training large models☆315Updated 7 months ago
- Package for extracting and mapping the results of every single tensor operation in a PyTorch model in one line of code.☆632Updated 2 months ago
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆219Updated last year
- Documented and Unit Tested educational Deep Learning framework with Autograd from scratch.☆122Updated last year
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆401Updated this week
- ☆210Updated last year
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆97Updated 11 months ago
- 🤖 A PyTorch library of curated Transformer models and their composable components☆893Updated last year
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆375Updated last year
- A repository for log-time feedforward networks☆223Updated last year
- Puzzles for exploring transformers☆378Updated 2 years ago
- Memory mapped numpy arrays of varying shapes☆305Updated last year
- ☆248Updated 5 months ago
- For optimization algorithm research and development.☆547Updated 2 weeks ago
- Next Generation Experimental Tracking for Machine Learning Operations☆354Updated 6 months ago