koyeb / tenstorrent-examplesLinks
☆14Updated 3 months ago
Alternatives and similar repositories for tenstorrent-examples
Users that are interested in tenstorrent-examples are comparing it to the libraries listed below
Sorting:
- Attention in SRAM on Tenstorrent Grayskull☆38Updated last year
- Tenstorrent Firmware repository☆21Updated this week
- Tenstorrent MLIR compiler☆185Updated this week
- ⭐️ TTNN Compiler for PyTorch 2 ⭐️ Enables running PyTorch models on Tenstorrent hardware using eager or compile path☆56Updated this week
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆117Updated this week
- High-Performance SGEMM on CUDA devices☆101Updated 8 months ago
- Repository of model demos using TT-Buda☆62Updated 5 months ago
- Custom PTX Instruction Benchmark☆127Updated 6 months ago
- Write a fast kernel and run it on Discord. See how you compare against the best!☆57Updated this week
- tiny code to access tenstorrent blackhole☆59Updated 3 months ago
- A Data-Centric Compiler for Machine Learning☆84Updated last year
- ☆64Updated this week
- Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.☆27Updated this week
- Simple experiments on Tenstorrent GraySkull e75 chip☆13Updated last year
- Tenstorrent console based hardware information program☆53Updated last week
- Buda Compiler Backend for Tenstorrent devices☆30Updated 5 months ago
- Tenstorrent TT-BUDA Repository☆316Updated 5 months ago
- Repository for AI model benchmarking on TT-Buda☆15Updated 6 months ago
- TT-Studio : An all-in-one platform to deploy and manage AI models optimized for Tenstorrent hardware with dedicated front-end demo applic…☆37Updated this week
- Samples of good AI generated CUDA kernels☆90Updated 3 months ago
- Memory Optimizations for Deep Learning (ICML 2023)☆107Updated last year
- ☆52Updated last year
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 6 months ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆46Updated last year
- E2E AutoML Model Compression Package☆46Updated 6 months ago
- The Riallto Open Source Project from AMD☆83Updated 5 months ago
- Official repository of Sparse ISO-FLOP Transformations for Maximizing Training Efficiency☆25Updated last year
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆45Updated last month
- python package of rocm-smi-lib☆23Updated 2 months ago
- ☆15Updated 2 years ago