FlorianDietz / comgra
A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different angles at the click of a button.
☆282Updated 3 months ago
Alternatives and similar repositories for comgra:
Users that are interested in comgra are comparing it to the libraries listed below
- Puzzles for exploring transformers☆333Updated last year
- Automatic gradient descent☆207Updated last year
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆601Updated 3 months ago
- A Jax-based library for designing and training transformer models from scratch.☆282Updated 6 months ago
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆398Updated this week
- TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conte…☆114Updated last month
- ☆214Updated 8 months ago
- A pure NumPy implementation of Mamba.☆219Updated 8 months ago
- Annotated version of the Mamba paper☆475Updated last year
- An interactive exploration of Transformer programming.☆261Updated last year
- Universal Tensor Operations in Einstein-Inspired Notation for Python.☆361Updated last month
- For optimization algorithm research and development.☆498Updated this week
- Language Modeling with the H3 State Space Model☆516Updated last year
- run paligemma in real time☆131Updated 10 months ago
- ☆420Updated 5 months ago
- Package for extracting and mapping the results of every single tensor operation in a PyTorch model in one line of code.☆563Updated 2 weeks ago
- ☆149Updated 7 months ago
- ☆301Updated 9 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆557Updated this week
- TensorDict is a pytorch dedicated tensor container.☆898Updated this week
- Efficient optimizers☆183Updated last week
- ☆143Updated last year
- 🤖 A PyTorch library of curated Transformer models and their composable components☆884Updated 11 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆219Updated 2 weeks ago
- Memory mapped numpy arrays of varying shapes☆295Updated 9 months ago
- Website for hosting the Open Foundation Models Cheat Sheet.☆264Updated last week
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆343Updated 7 months ago
- Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.☆381Updated 9 months ago
- Helpers and such for working with Lambda Cloud☆51Updated last year
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆279Updated last month