tenstorrent / tt-studio
TT-Studio : An all-in-one platform to deploy and manage AI models optimized for Tenstorrent hardware with dedicated front-end demo applications.
☆15Updated this week
Alternatives and similar repositories for tt-studio:
Users that are interested in tt-studio are comparing it to the libraries listed below
- A comprehensive tool for visualizing and analyzing model execution, offering interactive graphs, memory plots, tensor details, buffer ove…☆31Updated this week
- Tenstorrent console based hardware information program☆37Updated this week
- Tenstorrent MLIR compiler☆120Updated this week
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆43Updated this week
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆40Updated this week
- Repository of model demos using TT-Buda☆63Updated 3 weeks ago
- Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.☆15Updated this week
- Tenstorrent TT-BUDA Repository☆313Updated 3 weeks ago
- Attention in SRAM on Tenstorrent Grayskull☆34Updated 9 months ago
- Frontend integration for PyTorch with tt-mlir☆14Updated this week
- ☆11Updated this week
- Tenstorrent Kernel Module☆41Updated this week
- ⭐️ TTNN Compiler for PyTorch 2 ⭐️ It enables running PyTorch models on Tenstorrent hardware using torch.compile path☆36Updated this week
- ☆27Updated last month
- High-Performance SGEMM on CUDA devices☆90Updated 3 months ago
- IREE's PyTorch Frontend, based on Torch Dynamo.☆79Updated this week
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆109Updated last month
- torchtrail: trace the graph of torch functions and modules for visualization, reports, etc☆25Updated 10 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆40Updated last month
- Learning about CUDA by writing PTX code.☆128Updated last year
- TT-NN operator library, and TT-Metalium low level kernel programming model.☆829Updated this week
- Tenstorrent Firmware repository☆12Updated this week
- ☆14Updated this week
- Custom PTX Instruction Benchmark☆122Updated 2 months ago
- Train your own small bitnet model☆67Updated 6 months ago
- Inference Llama 2 with a model compiled to native code by TorchInductor☆14Updated last year
- ☆12Updated last year
- RDNA3 emulator☆54Updated last week
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆71Updated 2 months ago
- ☆156Updated 3 weeks ago