tenstorrent / tt-buda-demosLinks
Repository of model demos using TT-Buda
☆62Updated 3 months ago
Alternatives and similar repositories for tt-buda-demos
Users that are interested in tt-buda-demos are comparing it to the libraries listed below
Sorting:
- Tenstorrent TT-BUDA Repository☆313Updated 3 months ago
- Tenstorrent console based hardware information program☆45Updated last week
- ⭐️ TTNN Compiler for PyTorch 2 ⭐️ Enables running PyTorch models on Tenstorrent hardware using eager or compile path☆51Updated this week
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆78Updated this week
- Attention in SRAM on Tenstorrent Grayskull☆36Updated 11 months ago
- Buda Compiler Backend for Tenstorrent devices☆29Updated 3 months ago
- Tenstorrent Kernel Module☆46Updated this week
- TT-Studio : An all-in-one platform to deploy and manage AI models optimized for Tenstorrent hardware with dedicated front-end demo applic…☆21Updated this week
- Tenstorrent Firmware repository☆15Updated last week
- The Riallto Open Source Project from AMD☆81Updated 3 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆43Updated 3 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆109Updated this week
- AMD related optimizations for transformer models☆80Updated 3 weeks ago
- Tenstorrent Firmware Update Utility☆5Updated last week
- Tenstorrent MLIR compiler☆151Updated last week
- ☆48Updated last month
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆46Updated this week
- ☆60Updated last year
- Development repository for the Triton language and compiler☆125Updated this week
- ☆161Updated last week
- Custom PTX Instruction Benchmark☆126Updated 4 months ago
- TVM for Tenstorrent ASICs☆23Updated last week
- Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.☆18Updated this week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆87Updated this week
- ☆86Updated this week
- No-code CLI designed for accelerating ONNX workflows☆201Updated last month
- IREE's PyTorch Frontend, based on Torch Dynamo.☆90Updated this week
- High-Performance SGEMM on CUDA devices☆97Updated 5 months ago
- ☆40Updated this week
- GPUOcelot: A dynamic compilation framework for PTX☆201Updated 5 months ago