tenstorrent / tt-buda
Tenstorrent TT-BUDA Repository
☆291Updated this week
Alternatives and similar repositories for tt-buda:
Users that are interested in tt-buda are comparing it to the libraries listed below
- TT-NN operator library, and TT-Metalium low level kernel programming model.☆639Updated this week
- Tenstorrent MLIR compiler☆93Updated this week
- Repository of model demos using TT-Buda☆63Updated this week
- ⭐️ TTNN Compiler for PyTorch 2.0 ⭐️ It enables running PyTorch2.0 models on Tenstorrent hardware☆30Updated this week
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆341Updated this week
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆28Updated this week
- Buda Compiler Backend for Tenstorrent devices☆26Updated this week
- Tenstorrent console based hardware information program☆33Updated this week
- A open source reimplementation of Google's Tensor Processing Unit (TPU).☆408Updated 7 years ago
- Tenstorrent Kernel Module☆37Updated this week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆355Updated this week
- This is the top-level repository for the Accel-Sim framework.☆354Updated last week
- TVM for Tenstorrent ASICs☆21Updated this week
- Tenstorrent Firmware Update Utility☆13Updated this week
- GPUOcelot: A dynamic compilation framework for PTX☆174Updated 3 weeks ago
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆204Updated 5 months ago
- CSV spreadsheets and other material for AI accelerator survey papers☆163Updated last year
- Ocelot: The Berkeley Out-of-Order Machine With V-EXT support☆158Updated last month
- Backward compatible ML compute opset inspired by HLO/MHLO☆449Updated this week
- BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.☆530Updated 2 weeks ago
- Allo: A Programming Model for Composable Accelerator Design☆196Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆130Updated this week
- ☆90Updated this week
- A scalable High-Level Synthesis framework on MLIR☆246Updated 9 months ago
- Berkeley's Spatial Array Generator☆879Updated last week
- IREE plugin repository for the AMD AIE accelerator☆81Updated this week
- An experimental CPU backend for Triton☆94Updated this week
- Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.☆365Updated 3 weeks ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆449Updated last year
- HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing☆332Updated 10 months ago