geohot / tt-twitch
tenstorrent kernel from twitch
☆27Updated last year
Alternatives and similar repositories for tt-twitch:
Users that are interested in tt-twitch are comparing it to the libraries listed below
- RDNA3 emulator☆54Updated this week
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆39Updated this week
- FP4 MAC Array☆17Updated last year
- Custom PTX Instruction Benchmark☆123Updated last month
- ☆27Updated last month
- ctypes wrappers for HIP, CUDA, and OpenCL☆129Updated 9 months ago
- High-Performance SGEMM on CUDA devices☆90Updated 3 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆40Updated last month
- Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.☆14Updated this week
- IREE's PyTorch Frontend, based on Torch Dynamo.☆78Updated this week
- Attention in SRAM on Tenstorrent Grayskull☆33Updated 9 months ago
- LLM training in simple, raw C/CUDA☆92Updated 11 months ago
- asynchronous/distributed speculative evaluation for llama3☆39Updated 8 months ago
- Tenstorrent MLIR compiler☆119Updated this week
- Can I make an *optimizing* compiler under 1k lines of code?☆56Updated 2 months ago
- Schola is a plugin for enabling Reinforcement Learning (RL) in Unreal Engine. It provides tools to help developers create environments, d…☆34Updated 3 weeks ago
- ☆51Updated last week
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆89Updated last week
- Reference Kernels for the Leaderboard☆29Updated this week
- Simple experiments on Tenstorrent GraySkull e75 chip☆11Updated 7 months ago
- Tenstorrent system interface library☆17Updated this week
- A lightweight MLIR Python frontend with support for PyTorch☆23Updated 7 months ago
- Experimental GPU language with meta-programming☆22Updated 7 months ago
- ☆13Updated last month
- Tensor library with autograd using only Rust's standard library☆67Updated 9 months ago
- Learning about CUDA by writing PTX code.☆128Updated last year
- GPUOcelot: A dynamic compilation framework for PTX☆187Updated 2 months ago
- The Finite Field Assembly Programming Language☆36Updated last week
- Nvidia Instruction Set Specification Generator☆256Updated 9 months ago
- Random number library that generate pseudo-random and quasi-random numbers.☆26Updated this week