geohot / tt06-fp4-mac
FP4 MAC Array
☆18Updated 11 months ago
Alternatives and similar repositories for tt06-fp4-mac:
Users that are interested in tt06-fp4-mac are comparing it to the libraries listed below
- tenstorrent kernel from twitch☆27Updated 11 months ago
- RDNA3 emulator☆52Updated this week
- Generate python ctypes classes from C headers. Requires LLVM clang☆15Updated 7 months ago
- ctypes wrappers for HIP, CUDA, and OpenCL☆128Updated 8 months ago
- An implementation of delta-iris in tinygrad☆71Updated 6 months ago
- pytorch from scratch in pure C/CUDA and python☆40Updated 5 months ago
- parallelized hyperdimensional tictactoe☆114Updated 6 months ago
- Can RL solve simple problems?☆54Updated last year
- TinyFive is a lightweight RISC-V emulator and assembler written in Python with neural network examples☆55Updated last year
- Learning about CUDA by writing PTX code.☆123Updated last year
- Tensor library with autograd using only Rust's standard library☆65Updated 8 months ago
- Rust Implementation of micrograd☆51Updated 8 months ago
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in C☆128Updated 4 months ago
- A minimal Tensor Processing Unit (TPU) inspired by Google's TPUv1.☆134Updated 7 months ago
- The Finite Field Assembly Programming Language☆35Updated 3 weeks ago
- GGML implementation of BERT model with Python bindings and quantization.☆24Updated last year
- Scalar-valued Automatic Differentiation library in C☆48Updated last year
- Extensive introductory writeup on Zig language functionalities☆10Updated 7 months ago
- A simple Aarch64 hypervisor for Raspberry Pi☆34Updated 4 years ago
- GPT-2 in C☆65Updated 2 months ago
- gradient-based symbolic execution engine implemented from scratch☆35Updated last year
- ☆72Updated this week
- ☆29Updated 2 months ago
- Because it's there.☆15Updated 5 months ago
- Trying to find out what is the minimal model that can achieve 99% accuracy on MNIST dataset☆24Updated 6 years ago
- Nvidia Instruction Set Specification Generator☆254Updated 8 months ago
- Fork of Triton repository for OpenXLA uses of the Triton language and compiler☆11Updated this week
- minimal diffusion transformer in pytorch.☆16Updated 5 months ago