tenstorrent / tt-isa-documentationLinks

☆53

Alternatives and similar repositories for tt-isa-documentation

Users that are interested in tt-isa-documentation are comparing it to the libraries listed below

Sorting:

tenstorrent / tt-mlir
Tenstorrent MLIR compiler
☆165Updated this week
tinygrad / gpuctypes
ctypes wrappers for HIP, CUDA, and OpenCL
☆130Updated last year
tenstorrent / tt-forge
Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…
☆96Updated this week
gpuocelot / gpuocelot
GPUOcelot: A dynamic compilation framework for PTX
☆207Updated 5 months ago
Qazalin / remu
RDNA3 emulator
☆54Updated 3 months ago
tenstorrent / pytorch2.0_ttnn
⭐️ TTNN Compiler for PyTorch 2 ⭐️ Enables running PyTorch models on Tenstorrent hardware using eager or compile path
☆52Updated last week
kuterd / nv_isa_solver
Nvidia Instruction Set Specification Generator
☆285Updated last year
seb-v / fp32_sgemm_amd
Super fast FP32 matrix multiplication on RDNA3
☆70Updated 4 months ago
pytorch-labs / triton-cpu
An experimental CPU backend for Triton (https//github.com/openai/triton)
☆43Updated 4 months ago
iree-org / iree-turbine
IREE's PyTorch Frontend, based on Torch Dynamo.
☆94Updated this week
salykova / sgemm.cu
High-Performance SGEMM on CUDA devices
☆98Updated 6 months ago
moritztng / grayskull-attention
Attention in SRAM on Tenstorrent Grayskull
☆37Updated last year
tenstorrent / tt-forge-fe
The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…
☆48Updated this week
geohot / tt-twitch
tenstorrent kernel from twitch
☆28Updated last year
makslevental / mlir-python-extras
The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.
☆104Updated last week
openxla / shardy
MLIR-based partitioning system
☆115Updated this week
LaurieWired / BenchmarkCustomPTX
Custom PTX Instruction Benchmark
☆126Updated 5 months ago
makslevental / nelli
A lightweight, Pythonic, frontend for MLIR
☆80Updated last year
gevtushenko / llm.c
LLM training in simple, raw C/CUDA
☆102Updated last year
intel / mlir-extensions
Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.
☆138Updated this week
microsoft / Accera
Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research
☆109Updated last year
tenstorrent / tt-budabackend
Buda Compiler Backend for Tenstorrent devices
☆29Updated 4 months ago
wpmed92 / shaderpulse
A GLSL compiler targeting SPIR-V mlir
☆20Updated 9 months ago
tenstorrent / tt-xla
Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.
☆19Updated this week
tenstorrent / tt-buda-demos
Repository of model demos using TT-Buda
☆62Updated 4 months ago
jaebaek / tenstorrent-tiny-examples
Simple experiments on Tenstorrent GraySkull e75 chip
☆12Updated 11 months ago
NicolaLancellotti / metal-dialect
MLIR metal dialect
☆30Updated 10 months ago
ROCm / rocMLIR
☆148Updated this week
geohot / tt-tiny
tiny code to access tenstorrent blackhole
☆57Updated 2 months ago
iree-org / iree-experimental
Experiments and prototypes associated with IREE or MLIR
☆54Updated 11 months ago