tenstorrent / tt-buda
Tenstorrent TT-BUDA Repository
☆225Updated last month
Related projects ⓘ
Alternatives and complementary repositories for tt-buda
- TT-NN operator library, and TT-Metalium low level kernel programming model.☆475Updated this week
- Tenstorrent MLIR compiler☆75Updated this week
- Repository of model demos using TT-Buda☆55Updated 2 weeks ago
- ⭐️ TTNN Compiler for PyTorch 2.0 ⭐️ It enables running PyTorch2.0 models on Tenstorrent hardware☆25Updated this week
- Buda Compiler Backend for Tenstorrent devices☆26Updated last month
- TVM for Tenstorrent ASICs☆20Updated this week
- Tenstorrent Kernel Module☆32Updated last week
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆307Updated this week
- IREE's PyTorch Frontend, based on Torch Dynamo.☆55Updated this week
- Tenstorrent console based hardware information program☆23Updated 2 weeks ago
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆313Updated this week
- Development repository for the Triton language and compiler☆93Updated this week
- OpenAI Triton backend for Intel® GPUs☆143Updated this week
- IREE plugin repository for the AMD AIE accelerator☆69Updated this week
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆163Updated last month
- An experimental CPU backend for Triton☆56Updated last week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆124Updated this week
- ☆128Updated this week
- Attention in SRAM on Tenstorrent Grayskull☆29Updated 4 months ago
- Backward compatible ML compute opset inspired by HLO/MHLO☆412Updated this week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆95Updated last week
- Shared Middle-Layer for Triton Compilation☆191Updated this week
- hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditiona…☆63Updated this week
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆20Updated this week
- The Riallto Open Source Project from AMD☆68Updated last week
- BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.☆420Updated this week
- ☆80Updated this week
- GPUOcelot: A dynamic compilation framework for PTX☆147Updated last month
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆127Updated 3 weeks ago
- HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing☆326Updated 7 months ago