tenstorrent / tt-xlaLinks
Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.
☆40Updated this week
Alternatives and similar repositories for tt-xla
Users that are interested in tt-xla are comparing it to the libraries listed below
Sorting:
- Tenstorrent MLIR compiler☆206Updated this week
- IREE's PyTorch Frontend, based on Torch Dynamo.☆99Updated this week
- MLIR-based partitioning system☆144Updated this week
- ☆157Updated this week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆47Updated 2 months ago
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆51Updated this week
- Tenstorrent Kernel Module☆55Updated this week
- Buda Compiler Backend for Tenstorrent devices☆30Updated 7 months ago
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆111Updated 3 weeks ago
- IREE plugin repository for the AMD AIE accelerator☆112Updated this week
- An experimental CPU backend for Triton☆155Updated last week
- A lightweight, Pythonic, frontend for MLIR☆80Updated 2 years ago
- Experiments and prototypes associated with IREE or MLIR☆55Updated last year
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆135Updated this week
- AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming☆101Updated this week
- Attention in SRAM on Tenstorrent Grayskull☆38Updated last year
- The Riallto Open Source Project from AMD☆84Updated 6 months ago
- a simple end to end example of taking a ML graph (TF2 / PyTorch) and running it on a device [cpu, gpu]☆35Updated 4 years ago
- ☆76Updated this week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆102Updated 2 months ago
- TPP experimentation on MLIR for linear algebra☆137Updated last month
- OpenAI Triton backend for Intel® GPUs☆219Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆145Updated this week
- ⭐️ TTNN Compiler for PyTorch 2 ⭐️ Enables running PyTorch models on Tenstorrent hardware using eager or compile path☆60Updated this week
- High-Performance SGEMM on CUDA devices☆107Updated 9 months ago
- TVM for Tenstorrent ASICs☆27Updated last month
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆155Updated 3 months ago
- Tenstorrent Firmware repository☆24Updated this week
- This repository contains companion software for the Colfax Research paper "Categorical Foundations for CuTe Layouts".☆74Updated last month
- Super fast FP32 matrix multiplication on RDNA3☆78Updated 7 months ago