geohot / tt-twitch
tenstorrent kernel from twitch
☆27Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for tt-twitch
- RDNA3 emulator☆46Updated last week
- FP4 MAC Array☆18Updated 7 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆35Updated 6 months ago
- ctypes wrappers for HIP, CUDA, and OpenCL☆126Updated 4 months ago
- Nvidia Instruction Set Specification Generator☆215Updated 4 months ago
- ☆52Updated 5 months ago
- Tenstorrent Kernel Module☆33Updated last week
- A high-efficiency system-on-chip for floating-point compute workloads.☆16Updated this week
- Bistra is a domain-specific language designed to generate high-performance kernels (such as GEMMs, convolutions, etc). The program is des…☆6Updated 8 months ago
- asynchronous/distributed speculative evaluation for llama3☆37Updated 3 months ago
- Tenstorrent system interface library☆14Updated 2 weeks ago
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆69Updated this week
- IREE's PyTorch Frontend, based on Torch Dynamo.☆55Updated this week
- Attention in SRAM on Tenstorrent Grayskull☆29Updated 4 months ago
- Tenstorrent MLIR compiler☆76Updated this week
- The Riallto Open Source Project from AMD☆69Updated last week
- Buda Compiler Backend for Tenstorrent devices☆26Updated 2 months ago
- VectorVisor is a vectorizing binary translator for GPUs, designed to make it easy to run many copies of a single-threaded WebAssembly pro…☆147Updated last month
- A lightweight, Pythonic, frontend for MLIR☆80Updated last year
- A lightweight MLIR Python frontend with support for PyTorch☆21Updated 2 months ago
- An experimental CPU backend for Triton☆56Updated last week
- ☆17Updated last month
- Repository for AI model benchmarking.☆11Updated this week
- A Rust Library for High-Performance Tensor Exchange with Python☆39Updated this week
- Repository of model demos using TT-Buda☆55Updated 3 weeks ago
- Tenstorrent console based hardware information program☆23Updated 2 weeks ago
- A pure, low-level tensor program representation enabling tensor program optimization via program rewriting. See the web demo at https://g…☆71Updated 5 months ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆105Updated 3 months ago
- Sample Python extension using Rust/PyO3/tch to interact with PyTorch☆32Updated 9 months ago
- ☆65Updated this week