geohot / tt-tinyLinks
tiny code to access tenstorrent blackhole
☆59Updated 3 months ago
Alternatives and similar repositories for tt-tiny
Users that are interested in tt-tiny are comparing it to the libraries listed below
Sorting:
- RDNA3 emulator☆54Updated 4 months ago
- The Finite Field Assembly Programming Language☆36Updated 3 months ago
- Tensor library & inference framework for machine learning☆107Updated last week
- Tensor library with autograd using only Rust's standard library☆69Updated last year
- ☆58Updated this week
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆106Updated this week
- ☆36Updated last week
- Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.☆62Updated last year
- Train neural networks that distill into logic circuits, using JAX☆62Updated 2 months ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆170Updated 2 weeks ago
- An implementation of delta-iris in tinygrad☆72Updated last year
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 5 months ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆126Updated 4 months ago
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆211Updated last year
- SIMD quantization kernels☆83Updated this week
- Standalone commandline CLI tool for compiling Triton kernels☆17Updated 11 months ago
- High-Performance SGEMM on CUDA devices☆97Updated 7 months ago
- Learning about CUDA by writing PTX code.☆134Updated last year
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- ☆249Updated last year
- Because it's there.☆16Updated 11 months ago
- LLM training in simple, raw C/Metal Shading Language☆55Updated last year
- An implementation of bucketMul LLM inference☆223Updated last year
- Tenstorrent console based hardware information program☆52Updated last week
- parallelized hyperdimensional tictactoe☆124Updated last year
- Custom PTX Instruction Benchmark☆126Updated 6 months ago
- Nvidia Instruction Set Specification Generator☆290Updated last year
- It's a baby compiler. (Lean btw.)☆16Updated 3 months ago
- Samples of good AI generated CUDA kernels☆89Updated 2 months ago
- webgpu autograd library☆31Updated 3 months ago