geohot / tt-twitch
tenstorrent kernel from twitch
☆27Updated 11 months ago
Alternatives and similar repositories for tt-twitch:
Users that are interested in tt-twitch are comparing it to the libraries listed below
- RDNA3 emulator☆52Updated this week
- ctypes wrappers for HIP, CUDA, and OpenCL☆128Updated 8 months ago
- FP4 MAC Array☆18Updated 10 months ago
- LLM training in simple, raw C/CUDA☆92Updated 10 months ago
- parallelized hyperdimensional tictactoe☆113Updated 6 months ago
- Learning about CUDA by writing PTX code.☆123Updated last year
- The Finite Field Assembly Programming Language☆35Updated 3 weeks ago
- A lightweight MLIR Python frontend with support for PyTorch☆23Updated 6 months ago
- High-Performance SGEMM on CUDA devices☆86Updated last month
- Rust Implementation of micrograd☆51Updated 8 months ago
- Attention in SRAM on Tenstorrent Grayskull☆32Updated 7 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆39Updated 10 months ago
- Simple experiments on Tenstorrent GraySkull e75 chip☆10Updated 6 months ago
- Tensor library with autograd using only Rust's standard library☆65Updated 8 months ago
- Tenstorrent system interface library☆14Updated this week
- Tenstorrent MLIR compiler☆100Updated this week
- Random number library that generate pseudo-random and quasi-random numbers.☆26Updated this week
- Embedded Universal DSL: a good DSL for us, by us☆32Updated this week
- asynchronous/distributed speculative evaluation for llama3☆38Updated 7 months ago
- Because it's there.☆15Updated 5 months ago
- Standalone commandline CLI tool for compiling Triton kernels☆17Updated 6 months ago
- Exploration into the Firefly algorithm in Pytorch☆35Updated last month
- An implementation of delta-iris in tinygrad☆71Updated 6 months ago
- Better bindings for Python☆17Updated 2 years ago
- ☆54Updated 8 months ago
- Nvidia Instruction Set Specification Generator☆254Updated 8 months ago
- pytorch from scratch in pure C/CUDA and python☆40Updated 5 months ago
- ☆21Updated last week
- CUDA kernels in any language supported by LLVM☆24Updated last year