koyeb / tenstorrent-examplesLinks
☆13Updated 2 months ago
Alternatives and similar repositories for tenstorrent-examples
Users that are interested in tenstorrent-examples are comparing it to the libraries listed below
Sorting:
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆99Updated this week
- Attention in SRAM on Tenstorrent Grayskull☆38Updated last year
- Write a fast kernel and run it on Discord. See how you compare against the best!☆50Updated this week
- ⭐️ TTNN Compiler for PyTorch 2 ⭐️ Enables running PyTorch models on Tenstorrent hardware using eager or compile path☆53Updated this week
- Tenstorrent MLIR compiler☆169Updated this week
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated last week
- ☆53Updated this week
- A Data-Centric Compiler for Machine Learning☆84Updated last year
- High-Performance SGEMM on CUDA devices☆98Updated 6 months ago
- Tenstorrent console based hardware information program☆51Updated last week
- Custom PTX Instruction Benchmark☆126Updated 5 months ago
- tiny code to access tenstorrent blackhole☆58Updated 2 months ago
- 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.☆64Updated 6 months ago
- TT-Studio : An all-in-one platform to deploy and manage AI models optimized for Tenstorrent hardware with dedicated front-end demo applic…☆26Updated this week
- Learn GPU Programming in Mojo🔥 by Solving Puzzles☆107Updated last week
- E2E AutoML Model Compression Package☆45Updated 5 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆44Updated 4 months ago
- ☆28Updated 6 months ago
- Tenstorrent TT-BUDA Repository☆315Updated 4 months ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 4 months ago
- A lightweight MLIR Python frontend with support for PyTorch☆25Updated 11 months ago
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆286Updated 2 weeks ago
- General Matrix Multiplication using NVIDIA Tensor Cores☆18Updated 6 months ago
- Repository of model demos using TT-Buda☆62Updated 4 months ago
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆100Updated this week
- Experiment of using Tangent to autodiff triton☆80Updated last year
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆137Updated last year
- Cray-LM unified training and inference stack.☆22Updated 6 months ago
- GPU documentation for humans☆101Updated 3 weeks ago
- Simple experiments on Tenstorrent GraySkull e75 chip☆13Updated 11 months ago