eevaain / tiny-tpu-oldLinks
A minimal Tensor Processing Unit (TPU) inspired by Google's TPUv1.
☆167Updated last year
Alternatives and similar repositories for tiny-tpu-old
Users that are interested in tiny-tpu-old are comparing it to the libraries listed below
Sorting:
- could we make an ml stack in 100,000 lines of code?☆46Updated last year
- A minimal tensor processing unit (TPU), inspired by Google's TPU V2 and V1☆467Updated this week
- parallelized hyperdimensional tictactoe☆124Updated 11 months ago
- Verilog package manager written in Rust☆144Updated 10 months ago
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆159Updated last year
- Tensor library with autograd using only Rust's standard library☆69Updated last year
- Run 64-bit Linux on LiteX + RocketChip☆201Updated 3 weeks ago
- Solve puzzles to improve your tinygrad skills!☆142Updated 5 months ago
- Nvidia Instruction Set Specification Generator☆289Updated last year
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 4 months ago
- Learning about CUDA by writing PTX code.☆134Updated last year
- Tutorials on tinygrad☆402Updated 2 weeks ago
- RDNA3 emulator☆54Updated 4 months ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆189Updated last year
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- It's a core. Made on Twitch.☆262Updated 3 years ago
- pytorch from scratch in pure C/CUDA and python☆40Updated 10 months ago
- ☆89Updated this week
- A really tiny autograd engine☆95Updated 2 months ago
- Custom PTX Instruction Benchmark☆126Updated 5 months ago
- ☆55Updated last week
- High Quality Resources on GPU Programming/Architecture☆588Updated last year
- tiny code to access tenstorrent blackhole☆59Updated 2 months ago
- ☆449Updated 4 months ago
- A deep dive on the history of robotics and the future of humanoids☆99Updated 8 months ago
- Alex Krizhevsky's original code from Google Code☆196Updated 9 years ago
- Open source machine learning accelerators☆386Updated last year
- Visualization of cache-optimized matrix multiplication☆155Updated 5 months ago
- ☆99Updated last year
- A open source reimplementation of Google's Tensor Processing Unit (TPU).☆697Updated 7 years ago