koyeb / tenstorrent-examplesLinks
☆13Updated last week
Alternatives and similar repositories for tenstorrent-examples
Users that are interested in tenstorrent-examples are comparing it to the libraries listed below
Sorting:
- Attention in SRAM on Tenstorrent Grayskull☆38Updated last year
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆135Updated this week
- Tenstorrent MLIR compiler☆206Updated this week
- Tenstorrent console based hardware information program☆54Updated this week
- Tenstorrent Firmware repository☆24Updated this week
- ⭐️ TTNN Compiler for PyTorch 2 ⭐️ Enables running PyTorch models on Tenstorrent hardware using eager or compile path☆60Updated this week
- TT-Studio : An all-in-one platform to deploy and manage AI models optimized for Tenstorrent hardware with dedicated front-end demo applic…☆39Updated last week
- Buda Compiler Backend for Tenstorrent devices☆30Updated 7 months ago
- tiny code to access tenstorrent blackhole☆60Updated 5 months ago
- Tenstorrent TT-BUDA Repository☆315Updated 7 months ago
- ☆76Updated this week
- High-Performance SGEMM on CUDA devices☆107Updated 9 months ago
- Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.☆40Updated this week
- A Data-Centric Compiler for Machine Learning☆85Updated last year
- Write a fast kernel and run it on Discord. See how you compare against the best!☆61Updated this week
- Samples of good AI generated CUDA kernels☆91Updated 5 months ago
- Repository of model demos using TT-Buda☆63Updated 7 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆47Updated 2 months ago
- Simple experiments on Tenstorrent GraySkull e75 chip☆13Updated last year
- The Riallto Open Source Project from AMD☆84Updated 6 months ago
- ☆49Updated last month
- Custom PTX Instruction Benchmark☆131Updated 8 months ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆192Updated 2 years ago
- How to ensure correctness and ship LLM generated kernels in PyTorch☆111Updated this week
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆51Updated this week
- Tenstorrent Kernel Module☆55Updated this week
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆169Updated last year
- A Learning Journey: Micrograd in Mojo 🔥☆63Updated last year
- Repository for AI model benchmarking on TT-Buda☆15Updated 8 months ago
- User-Mode Driver for Tenstorrent hardware☆34Updated last week