koyeb / tenstorrent-examplesLinks
☆13Updated last month
Alternatives and similar repositories for tenstorrent-examples
Users that are interested in tenstorrent-examples are comparing it to the libraries listed below
Sorting:
- Write a fast kernel and run it on Discord. See how you compare against the best!☆46Updated this week
- Attention in SRAM on Tenstorrent Grayskull☆36Updated last year
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated 2 months ago
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆80Updated this week
- TT-Studio : An all-in-one platform to deploy and manage AI models optimized for Tenstorrent hardware with dedicated front-end demo applic…☆22Updated this week
- tiny code to access tenstorrent blackhole☆55Updated last month
- 👷 Build compute kernels☆77Updated this week
- Tenstorrent console based hardware information program☆47Updated last week
- E2E AutoML Model Compression Package☆46Updated 4 months ago
- Samples of good AI generated CUDA kernels☆84Updated last month
- ☆13Updated 4 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆43Updated 4 months ago
- Cray-LM unified training and inference stack.☆22Updated 5 months ago
- High-Performance SGEMM on CUDA devices☆97Updated 5 months ago
- Open Source Projects from Pallas Lab☆20Updated 3 years ago
- Memory Optimizations for Deep Learning (ICML 2023)☆98Updated last year
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆96Updated this week
- Make triton easier☆47Updated last year
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 3 months ago
- ☆28Updated 6 months ago
- Custom PTX Instruction Benchmark☆126Updated 4 months ago
- ☆12Updated 2 weeks ago
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆13Updated 7 months ago
- 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.☆64Updated 5 months ago
- Personal solutions to the Triton Puzzles☆19Updated last year
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆89Updated 2 weeks ago
- torchtrail: trace the graph of torch functions and modules for visualization, reports, etc☆25Updated last month
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆108Updated 2 months ago
- Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".☆40Updated 3 years ago
- Experiment of using Tangent to autodiff triton☆79Updated last year