tenstorrent / tt-isa-documentationLinks
☆87Updated 2 weeks ago
Alternatives and similar repositories for tt-isa-documentation
Users that are interested in tt-isa-documentation are comparing it to the libraries listed below
Sorting:
- Tenstorrent MLIR compiler☆233Updated this week
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆53Updated this week
- Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.☆50Updated this week
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆166Updated this week
- IREE's PyTorch Frontend, based on Torch Dynamo.☆103Updated last week
- GPUOcelot: A dynamic compilation framework for PTX☆219Updated 11 months ago
- Super fast FP32 matrix multiplication on RDNA3☆82Updated 9 months ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆148Updated this week
- Nvidia Instruction Set Specification Generator☆310Updated last year
- A custom AI chip to be taped out soon!☆37Updated 3 weeks ago
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆194Updated 5 months ago
- [Deprecated] ⭐️ TT-NN Compiler for PyTorch 2 ⭐️ Enables running PyTorch models on Tenstorrent hardware using eager or compile path☆61Updated 2 weeks ago
- Buda Compiler Backend for Tenstorrent devices☆30Updated 9 months ago
- Tenstorrent Kernel Module☆57Updated this week
- MLIR-based partitioning system☆160Updated this week
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆117Updated 2 months ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆141Updated last year
- ☆162Updated this week
- Experiments and prototypes associated with IREE or MLIR☆56Updated last year
- Attention in SRAM on Tenstorrent Grayskull☆40Updated last year
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆48Updated 4 months ago
- ☆31Updated this week
- High-Performance SGEMM on CUDA devices☆115Updated 11 months ago
- Tenstorrent console based hardware information program☆58Updated this week
- Simple experiments on Tenstorrent GraySkull e75 chip☆13Updated last year
- Tenstorrent TT-BUDA Repository☆314Updated 9 months ago
- RDNA3 emulator☆55Updated 9 months ago
- Custom PTX Instruction Benchmark☆137Updated 10 months ago
- Tutorial on building a gpu compiler backend in LLVM☆52Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆138Updated this week