tenstorrent / tt-inference-serverLinks
☆21Updated this week
Alternatives and similar repositories for tt-inference-server
Users that are interested in tt-inference-server are comparing it to the libraries listed below
Sorting:
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆48Updated this week
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆99Updated this week
- Tenstorrent MLIR compiler☆169Updated this week
- Tenstorrent Kernel Module☆50Updated this week
- TVM for Tenstorrent ASICs☆25Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆20Updated this week
- Tenstorrent console based hardware information program☆51Updated last week
- Tenstorrent TT-BUDA Repository☆315Updated 4 months ago
- Buda Compiler Backend for Tenstorrent devices☆30Updated 4 months ago
- QuickReduce is a performant all-reduce library designed for AMD ROCm that supports inline compression.☆31Updated 4 months ago
- Tenstorrent Firmware repository☆18Updated 2 weeks ago
- Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.☆19Updated this week
- Attention in SRAM on Tenstorrent Grayskull☆38Updated last year
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆132Updated 3 weeks ago
- TT-NN operator library, and TT-Metalium low level kernel programming model.☆1,069Updated this week
- An experimental CPU backend for Triton☆139Updated 2 months ago
- LIBRA: Enabling Workload-aware Multi-dimensional Network Topology Optimization for Distributed Training of Large AI Models☆11Updated last year
- IREE plugin repository for the AMD AIE accelerator☆100Updated last week
- Frontend integration for PyTorch with tt-mlir☆23Updated this week
- Advanced Matrix Extensions (AMX) Guide☆95Updated 3 years ago
- TT-Studio : An all-in-one platform to deploy and manage AI models optimized for Tenstorrent hardware with dedicated front-end demo applic…☆26Updated this week
- ☆29Updated 4 months ago
- RCCL Performance Benchmark Tests☆72Updated last week
- ☆104Updated this week
- ☆31Updated 11 years ago
- ☆146Updated last year
- ☆28Updated 2 years ago
- TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning☆25Updated 2 months ago
- Tenstorrent Topology (TT-Topology) is a command line utility used to flash multiple NB cards on a system to use specific eth routing conf…☆13Updated last week
- SHARK Inference Modeling and Serving☆43Updated this week