geohot / tt-twitchLinks
tenstorrent kernel from twitch
☆28Updated last year
Alternatives and similar repositories for tt-twitch
Users that are interested in tt-twitch are comparing it to the libraries listed below
Sorting:
- RDNA3 emulator☆54Updated 2 months ago
- ☆46Updated last week
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆72Updated this week
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated 11 months ago
- Simple experiments on Tenstorrent GraySkull e75 chip☆12Updated 9 months ago
- Tenstorrent system interface library☆24Updated this week
- A lightweight MLIR Python frontend with support for PyTorch☆23Updated 9 months ago
- Embedded Universal DSL: a good DSL for us, by us☆38Updated this week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆43Updated 3 months ago
- Tenstorrent console based hardware information program☆43Updated this week
- Super fast FP32 matrix multiplication on RDNA3☆64Updated 2 months ago
- tiny code to access tenstorrent blackhole☆52Updated last month
- Tenstorrent MLIR compiler☆140Updated this week
- High-Performance SGEMM on CUDA devices☆95Updated 5 months ago
- ☆18Updated this week
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆44Updated this week
- Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.☆18Updated this week
- Nvidia Instruction Set Specification Generator☆278Updated 11 months ago
- Attention in SRAM on Tenstorrent Grayskull☆36Updated 11 months ago
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆102Updated this week
- A lightweight, Pythonic, frontend for MLIR☆81Updated last year
- TT-Studio : An all-in-one platform to deploy and manage AI models optimized for Tenstorrent hardware with dedicated front-end demo applic…☆20Updated this week
- MLIR-based partitioning system☆97Updated this week
- Custom PTX Instruction Benchmark☆126Updated 3 months ago
- materials available to the public☆25Updated 7 months ago
- Tensor library with autograd using only Rust's standard library☆68Updated 11 months ago
- Fork of Triton repository for OpenXLA uses of the Triton language and compiler☆11Updated 2 weeks ago
- ☆29Updated 3 months ago
- asynchronous/distributed speculative evaluation for llama3☆39Updated 10 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆26Updated last week