tenstorrent / ttnn-visualizer
A comprehensive tool for visualizing and analyzing model execution, offering interactive graphs, memory plots, tensor details, buffer overviews, operation flow graphs, and multi-instance support with file or SSH-based report loading.
☆26Updated this week
Alternatives and similar repositories for ttnn-visualizer:
Users that are interested in ttnn-visualizer are comparing it to the libraries listed below
- Tenstorrent MLIR compiler☆100Updated this week
- Tenstorrent TT-BUDA Repository☆296Updated last week
- Attention in SRAM on Tenstorrent Grayskull☆32Updated 7 months ago
- TT-NN operator library, and TT-Metalium low level kernel programming model.☆662Updated this week
- torchtrail: trace the graph of torch functions and modules for visualization, reports, etc☆25Updated 9 months ago
- High-Performance SGEMM on CUDA devices☆86Updated last month
- ⭐️ TTNN Compiler for PyTorch 2.0 ⭐️ It enables running PyTorch2.0 models on Tenstorrent hardware☆30Updated this week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆39Updated 10 months ago
- KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems☆227Updated last week
- Repository of model demos using TT-Buda☆63Updated last week
- Fast Matrix Multiplications for Lookup Table-Quantized LLMs☆231Updated 2 weeks ago
- Fastest kernels written from scratch☆188Updated last week
- ☆12Updated last year
- ☆34Updated this week
- Simple experiments on Tenstorrent GraySkull e75 chip☆10Updated 6 months ago
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆28Updated this week
- Learning about CUDA by writing PTX code.☆123Updated last year
- Nvidia Instruction Set Specification Generator☆254Updated 8 months ago
- Machine-Learning Accelerator System Exploration Tools☆149Updated this week
- Tenstorrent console based hardware information program☆35Updated this week
- An experimental CPU backend for Triton☆99Updated this week
- Fast Hadamard transform in CUDA, with a PyTorch interface☆151Updated 9 months ago
- ☆188Updated 3 weeks ago
- Write a fast kernel and run it on Discord. See how you compare against the best!☆31Updated this week
- High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.☆100Updated 8 months ago
- OpenAI Triton backend for Intel® GPUs☆168Updated this week
- ☆15Updated 5 months ago
- Cataloging released Triton kernels.☆185Updated 2 months ago