tenstorrent / tt-metalLinks
TT-NN operator library, and TT-Metalium low level kernel programming model.
☆898Updated this week
Alternatives and similar repositories for tt-metal
Users that are interested in tt-metal are comparing it to the libraries listed below
Sorting:
- Tenstorrent TT-BUDA Repository☆312Updated 2 months ago
- ⭐️ TTNN Compiler for PyTorch 2 ⭐️ It enables running PyTorch models on Tenstorrent hardware using torch.compile path☆42Updated this week
- Tenstorrent MLIR compiler☆132Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆16Updated this week
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆60Updated this week
- Frontend integration for PyTorch with tt-mlir☆21Updated this week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆401Updated this week
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆44Updated this week
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆408Updated this week
- torchtrail: trace the graph of torch functions and modules for visualization, reports, etc☆25Updated 2 weeks ago
- BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.☆619Updated last month
- Backward compatible ML compute opset inspired by HLO/MHLO☆485Updated this week
- AI Tensor Engine for ROCm☆201Updated this week
- Repository of model demos using TT-Buda☆62Updated 2 months ago
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,552Updated this week
- Fork of LLVM to support AMD AIEngine processors☆143Updated this week
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆506Updated 2 years ago
- Tenstorrent Kernel Module☆44Updated this week
- Awesome resources for GPUs☆572Updated last year
- Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels☆1,250Updated this week
- OpenAI Triton backend for Intel® GPUs☆189Updated this week
- Tutorials on tinygrad☆381Updated 3 weeks ago
- Tenstorrent console based hardware information program☆37Updated last week
- Tile primitives for speedy kernels☆2,420Updated this week
- Tenstorrent Firmware repository☆13Updated this week
- Exocompilation for productive programming of hardware accelerators☆607Updated this week
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆241Updated last week
- Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA☆858Updated this week
- CUDA Kernel Benchmarking Library☆656Updated last week
- PyTorch native quantization and sparsity for training and inference☆2,088Updated this week