tenstorrent / tt-inference-serverLinks
☆43Updated this week
Alternatives and similar repositories for tt-inference-server
Users that are interested in tt-inference-server are comparing it to the libraries listed below
Sorting:
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆53Updated this week
- TT-Studio : An all-in-one platform to deploy and manage AI models optimized for Tenstorrent hardware with dedicated front-end demo applic…☆39Updated 3 weeks ago
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆162Updated this week
- Tenstorrent MLIR compiler☆231Updated this week
- Tenstorrent TT-BUDA Repository☆314Updated 9 months ago
- Tenstorrent console based hardware information program☆58Updated this week
- ☆27Updated 9 months ago
- Tenstorrent Kernel Module☆57Updated this week
- TVM for Tenstorrent ASICs☆28Updated 4 months ago
- AMD-SHARK Inference Modeling and Serving☆59Updated this week
- Repository for AI model benchmarking on TT-Buda☆15Updated 10 months ago
- Attention in SRAM on Tenstorrent Grayskull☆40Updated last year
- An experimental CPU backend for Triton☆168Updated 2 months ago
- QuickReduce is a performant all-reduce library designed for AMD ROCm that supports inline compression.☆36Updated 4 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆26Updated this week
- [Deprecated] ⭐️ TT-NN Compiler for PyTorch 2 ⭐️ Enables running PyTorch models on Tenstorrent hardware using eager or compile path☆61Updated last week
- GPUOcelot: A dynamic compilation framework for PTX☆219Updated 11 months ago
- ☆86Updated last week
- Official Problem Sets / Reference Kernels for the GPU MODE Leaderboard!☆182Updated 2 weeks ago
- Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.☆50Updated this week
- AI Tensor Engine for ROCm☆330Updated this week
- MLIR-based partitioning system☆157Updated this week
- Repository of model demos using TT-Buda☆63Updated 9 months ago
- CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-base…☆763Updated 3 weeks ago
- Helpful kernel tutorials and examples for tile-based GPU programming☆554Updated this week
- Buda Compiler Backend for Tenstorrent devices☆30Updated 9 months ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆139Updated last year
- Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.☆437Updated 3 weeks ago
- Fast and Furious AMD Kernels☆331Updated 2 weeks ago
- A fast full-system simulator of Tenstorrent hardware☆38Updated 3 weeks ago