tenstorrent / tt-metal
TT-NN operator library, and TT-Metalium low level kernel programming model.
☆662Updated this week
Alternatives and similar repositories for tt-metal:
Users that are interested in tt-metal are comparing it to the libraries listed below
- Tenstorrent TT-BUDA Repository☆296Updated last week
- ⭐️ TTNN Compiler for PyTorch 2.0 ⭐️ It enables running PyTorch2.0 models on Tenstorrent hardware☆30Updated this week
- Tenstorrent MLIR compiler☆100Updated this week
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆346Updated this week
- torchtrail: trace the graph of torch functions and modules for visualization, reports, etc☆25Updated 9 months ago
- Repository of model demos using TT-Buda☆63Updated last week
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆28Updated this week
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆461Updated last year
- Tenstorrent Firmware Update Utility☆13Updated last week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆362Updated this week
- Tenstorrent console based hardware information program☆35Updated this week
- This is the top-level repository for the Accel-Sim framework.☆366Updated this week
- Backward compatible ML compute opset inspired by HLO/MHLO☆453Updated this week
- A comprehensive tool for visualizing and analyzing model execution, offering interactive graphs, memory plots, tensor details, buffer ove…☆26Updated this week
- BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.☆541Updated 3 weeks ago
- OpenAI Triton backend for Intel® GPUs☆168Updated this week
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆207Updated 5 months ago
- Tenstorrent Kernel Module☆39Updated this week
- Shared Middle-Layer for Triton Compilation☆230Updated this week
- Awesome resources for GPUs☆551Updated last year
- An experimental CPU backend for Triton☆99Updated this week
- Berkeley's Spatial Array Generator☆889Updated 3 weeks ago
- Buda Compiler Backend for Tenstorrent devices☆26Updated last week
- GPUOcelot: A dynamic compilation framework for PTX☆178Updated last month
- ☆137Updated this week
- MLIR For Beginners tutorial☆922Updated last month
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆826Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆232Updated this week
- hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditiona…☆79Updated this week
- IREE plugin repository for the AMD AIE accelerator☆83Updated this week