FlorianDietz / comgraLinks
A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different angles at the click of a button.
☆291Updated 11 months ago
Alternatives and similar repositories for comgra
Users that are interested in comgra are comparing it to the libraries listed below
Sorting:
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆449Updated 3 months ago
- A pure NumPy implementation of Mamba.☆223Updated last year
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆296Updated last year
- Automatic gradient descent☆215Updated 2 years ago
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆351Updated last year
- ☆150Updated last year
- ☆144Updated 2 years ago
- TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conte…☆121Updated 8 months ago
- An interactive exploration of Transformer programming.☆270Updated last year
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆625Updated 7 months ago
- An implementation of PSGD Kron second-order optimizer for PyTorch☆96Updated 3 months ago
- ☆310Updated last year
- a small code base for training large models☆313Updated 6 months ago
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆285Updated last month
- Documented and Unit Tested educational Deep Learning framework with Autograd from scratch.☆122Updated last year
- 🤖 A PyTorch library of curated Transformer models and their composable components☆893Updated last year
- Package for extracting and mapping the results of every single tensor operation in a PyTorch model in one line of code.☆624Updated last month
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆124Updated 6 months ago
- Puzzles for exploring transformers☆376Updated 2 years ago
- run paligemma in real time☆133Updated last year
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆216Updated 11 months ago
- Highly commented implementations of Transformers in PyTorch☆136Updated 2 years ago
- Open weights language model from Google DeepMind, based on Griffin.☆653Updated 5 months ago
- Teaching transformers to play chess☆141Updated 9 months ago
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆400Updated this week
- A repository for log-time feedforward networks☆222Updated last year
- ☆248Updated 4 months ago
- git extension for {collaborative, communal, continual} model development☆215Updated 11 months ago
- Next Generation Experimental Tracking for Machine Learning Operations☆349Updated 5 months ago
- Memory mapped numpy arrays of varying shapes☆304Updated last year