geohot / tt-tinyLinks
tiny code to access tenstorrent blackhole
☆60Updated 5 months ago
Alternatives and similar repositories for tt-tiny
Users that are interested in tt-tiny are comparing it to the libraries listed below
Sorting:
- RDNA3 emulator☆54Updated 6 months ago
- ☆76Updated last month
- The Finite Field Assembly Programming Language☆36Updated 5 months ago
- Tensor library & inference framework for machine learning☆113Updated 3 weeks ago
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- ☆42Updated 2 weeks ago
- Tensor library with autograd using only Rust's standard library☆70Updated last year
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆124Updated 6 months ago
- An implementation of delta-iris in tinygrad☆72Updated last year
- Standalone commandline CLI tool for compiling Triton kernels☆18Updated last year
- SIMD quantization kernels☆89Updated last month
- Learning about CUDA by writing PTX code.☆144Updated last year
- Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.☆381Updated last week
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆214Updated last year
- Nvidia Instruction Set Specification Generator☆297Updated last year
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 7 months ago
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆133Updated last month
- ☆248Updated last year
- High-Performance SGEMM on CUDA devices☆107Updated 9 months ago
- Tenstorrent console based hardware information program☆54Updated this week
- A minimalistic C++ Jinja templating engine for LLM chat templates☆190Updated last month
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆126Updated this week
- ⭐️ TTNN Compiler for PyTorch 2 ⭐️ Enables running PyTorch models on Tenstorrent hardware using eager or compile path☆57Updated this week
- GPEmu, a GPU emulator for faster and cheaper prototyping and evaluation of deep learning system research☆30Updated 10 months ago
- ☆16Updated 4 months ago
- Train neural networks that distill into logic circuits, using JAX☆62Updated 4 months ago
- C API for MLX☆144Updated last month
- Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.☆63Updated last year
- ☆448Updated 6 months ago
- Custom PTX Instruction Benchmark☆131Updated 8 months ago