geohot / tt-twitchLinks
tenstorrent kernel from twitch
☆27Updated last year
Alternatives and similar repositories for tt-twitch
Users that are interested in tt-twitch are comparing it to the libraries listed below
Sorting:
- RDNA3 emulator☆54Updated last month
- ☆30Updated this week
- Super fast FP32 matrix multiplication on RDNA3☆61Updated 2 months ago
- Embedded Universal DSL: a good DSL for us, by us☆37Updated this week
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆59Updated this week
- FP4 MAC Array☆19Updated last year
- ctypes wrappers for HIP, CUDA, and OpenCL☆129Updated 11 months ago
- Learning about CUDA by writing PTX code.☆131Updated last year
- Custom PTX Instruction Benchmark☆126Updated 3 months ago
- ☆29Updated 2 months ago
- Schola is a plugin for enabling Reinforcement Learning (RL) in Unreal Engine. It provides tools to help developers create environments, d…☆42Updated 3 weeks ago
- Tensor library with autograd using only Rust's standard library☆68Updated 11 months ago
- A lightweight MLIR Python frontend with support for PyTorch☆23Updated 9 months ago
- LLM training in simple, raw C/CUDA☆99Updated last year
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆127Updated 5 months ago
- ☆54Updated 11 months ago
- tiny code to access tenstorrent blackhole☆48Updated last week
- LLVM Code Generation, published by Packt☆35Updated last week
- Nvidia Instruction Set Specification Generator☆271Updated 10 months ago
- Attention in SRAM on Tenstorrent Grayskull☆35Updated 10 months ago
- Tenstorrent system interface library☆19Updated last week
- ☆54Updated this week
- A GLSL compiler targeting SPIR-V mlir☆20Updated 7 months ago
- Simple experiments on Tenstorrent GraySkull e75 chip☆11Updated 9 months ago
- The Finite Field Assembly Programming Language☆37Updated 2 weeks ago
- TinyFive is a lightweight RISC-V emulator and assembler written in Python with neural network examples☆62Updated last year
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆43Updated 2 months ago
- asynchronous/distributed speculative evaluation for llama3☆38Updated 9 months ago
- Tenstorrent MLIR compiler☆132Updated this week
- pytorch from scratch in pure C/CUDA and python☆40Updated 7 months ago