tenstorrent / tt-buda
Tenstorrent TT-BUDA Repository
☆307Updated 2 weeks ago
Alternatives and similar repositories for tt-buda:
Users that are interested in tt-buda are comparing it to the libraries listed below
- TT-NN operator library, and TT-Metalium low level kernel programming model.☆681Updated this week
- Repository of model demos using TT-Buda☆63Updated this week
- Tenstorrent MLIR compiler☆109Updated this week
- ⭐️ TTNN Compiler for PyTorch 2.0 ⭐️ It enables running PyTorch2.0 models on Tenstorrent hardware☆33Updated this week
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆30Updated this week
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆354Updated this week
- Tenstorrent console based hardware information program☆35Updated last week
- Buda Compiler Backend for Tenstorrent devices☆28Updated last month
- Tenstorrent Kernel Module☆39Updated this week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆373Updated this week
- Tenstorrent Firmware Update Utility☆13Updated 2 weeks ago
- TVM for Tenstorrent ASICs☆21Updated this week
- AI Tensor Engine for ROCm☆142Updated this week
- Open source machine learning accelerators☆375Updated last year
- Backward compatible ML compute opset inspired by HLO/MHLO☆457Updated last week
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆212Updated 6 months ago
- An experimental CPU backend for Triton☆103Updated this week
- GPUOcelot: A dynamic compilation framework for PTX☆182Updated last month
- IREE plugin repository for the AMD AIE accelerator☆87Updated this week
- Ocelot: The Berkeley Out-of-Order Machine With V-EXT support☆158Updated 2 months ago
- OpenAI Triton backend for Intel® GPUs☆172Updated this week
- CSV spreadsheets and other material for AI accelerator survey papers☆164Updated last year
- A comprehensive tool for visualizing and analyzing model execution, offering interactive graphs, memory plots, tensor details, buffer ove…☆27Updated this week
- BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.☆572Updated last month
- hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditiona…☆84Updated this week
- Allo: A Programming Model for Composable Accelerator Design☆220Updated last week
- Development repository for the Triton language and compiler☆114Updated this week
- ☆91Updated last week
- IREE's PyTorch Frontend, based on Torch Dynamo.☆74Updated this week
- HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing☆335Updated 11 months ago