Qazalin / remuLinks

RDNA3 emulator

☆54

Alternatives and similar repositories for remu

Users that are interested in remu are comparing it to the libraries listed below

Sorting:

kuterd / nv_isa_solver
Nvidia Instruction Set Specification Generator
☆286Updated last year
tenstorrent / tt-isa-documentation
☆53Updated this week
tinygrad / gpuctypes
ctypes wrappers for HIP, CUDA, and OpenCL
☆130Updated last year
geohot / tt-twitch
tenstorrent kernel from twitch
☆29Updated last year
tenstorrent / luwen
Tenstorrent system interface library
☆30Updated last week
tenstorrent / tt-smi
Tenstorrent console based hardware information program
☆49Updated this week
tenstorrent / tt-forge
Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…
☆99Updated this week
geohot / tt-tiny
tiny code to access tenstorrent blackhole
☆57Updated 2 months ago
tenstorrent / tt-mlir
Tenstorrent MLIR compiler
☆169Updated this week
tinygrad / 7900xtx
☆449Updated 4 months ago
LaurieWired / BenchmarkCustomPTX
Custom PTX Instruction Benchmark
☆126Updated 5 months ago
tzakharko / m4-sme-exploration
Exploring the scalable matrix extension of the Apple M4 processor
☆193Updated 9 months ago
geohot / tt06-fp4-mac
FP4 MAC Array
☆19Updated last year
seb-v / fp32_sgemm_amd
Super fast FP32 matrix multiplication on RDNA3
☆70Updated 4 months ago
tenstorrent / tt-kmd
Tenstorrent Kernel Module
☆50Updated this week
gpuocelot / gpuocelot
GPUOcelot: A dynamic compilation framework for PTX
☆207Updated 6 months ago
geohot / cuda_ioctl_sniffer
Sniff CUDA ioctls
☆204Updated 2 years ago
exo-lang / exo
Exocompilation for productive programming of hardware accelerators
☆651Updated this week
spikedoanz / tensor-tic-tac-toe
parallelized hyperdimensional tictactoe
☆123Updated 11 months ago
makslevental / mlir-python-extras
The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.
☆104Updated this week
LeetArxiv / Finite-Field-Assembly
The Finite Field Assembly Programming Language
☆36Updated 2 months ago
salykova / sgemm.cu
High-Performance SGEMM on CUDA devices
☆98Updated 6 months ago
tenstorrent / tensix-isa-simulator
☆29Updated 4 months ago
moritztng / grayskull-attention
Attention in SRAM on Tenstorrent Grayskull
☆37Updated last year
amd / fuzzyHSA
☆54Updated last year
joennlae / halutmatmul
Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator
☆211Updated last year
tenstorrent / pytorch2.0_ttnn
⭐️ TTNN Compiler for PyTorch 2 ⭐️ Enables running PyTorch models on Tenstorrent hardware using eager or compile path
☆53Updated this week
tenstorrent / tt-forge-fe
The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…
☆48Updated this week
tenstorrent / tt-buda
Tenstorrent TT-BUDA Repository
☆315Updated 4 months ago
iree-org / iree-turbine
IREE's PyTorch Frontend, based on Torch Dynamo.
☆94Updated this week