tenstorrent / tt-forgeLinks
Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-source, general, and performant compiler.
☆162Updated this week
Alternatives and similar repositories for tt-forge
Users that are interested in tt-forge are comparing it to the libraries listed below
Sorting:
- Tenstorrent MLIR compiler☆231Updated this week
- [Deprecated] ⭐️ TT-NN Compiler for PyTorch 2 ⭐️ Enables running PyTorch models on Tenstorrent hardware using eager or compile path☆61Updated last week
- Tenstorrent TT-BUDA Repository☆314Updated 9 months ago
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆53Updated this week
- Tenstorrent console based hardware information program☆58Updated this week
- Tenstorrent Kernel Module☆57Updated last week
- ☆86Updated last week
- TT-NN operator library, and TT-Metalium low level kernel programming model.☆1,303Updated this week
- Official Problem Sets / Reference Kernels for the GPU MODE Leaderboard!☆182Updated 2 weeks ago
- Attention in SRAM on Tenstorrent Grayskull☆40Updated last year
- CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-base…☆763Updated 3 weeks ago
- Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.☆50Updated this week
- AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming☆143Updated last week
- AI Tensor Engine for ROCm☆330Updated this week
- TT-Studio : An all-in-one platform to deploy and manage AI models optimized for Tenstorrent hardware with dedicated front-end demo applic…☆39Updated 3 weeks ago
- Fast and Furious AMD Kernels☆331Updated last week
- ☆83Updated last month
- Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.☆437Updated 3 weeks ago
- Custom PTX Instruction Benchmark☆137Updated 10 months ago
- GPUOcelot: A dynamic compilation framework for PTX☆219Updated 11 months ago
- An experimental CPU backend for Triton☆168Updated 2 months ago
- MLIR-based partitioning system☆157Updated this week
- TVM for Tenstorrent ASICs☆28Updated 4 months ago
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆556Updated this week
- High-Performance SGEMM on CUDA devices☆115Updated 11 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Updated last week
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆193Updated 5 months ago
- OpenAI Triton backend for Intel® GPUs☆223Updated this week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆48Updated 4 months ago
- Nvidia Instruction Set Specification Generator☆309Updated last year