eevaain / tiny-tpu-oldLinks
A minimal Tensor Processing Unit (TPU) inspired by Google's TPUv1.
☆185Updated last year
Alternatives and similar repositories for tiny-tpu-old
Users that are interested in tiny-tpu-old are comparing it to the libraries listed below
Sorting:
- could we make an ml stack in 100,000 lines of code?☆46Updated last year
- A minimal tensor processing unit (TPU), inspired by Google's TPU V2 and V1☆967Updated 2 months ago
- parallelized hyperdimensional tictactoe☆125Updated last year
- Solve puzzles to improve your tinygrad skills!☆145Updated last week
- Nvidia Instruction Set Specification Generator☆297Updated last year
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆163Updated last year
- Run 64-bit Linux on LiteX + RocketChip☆202Updated 2 weeks ago
- Verilog package manager written in Rust☆143Updated last year
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 7 months ago
- Tensor library with autograd using only Rust's standard library☆70Updated last year
- Tutorials on tinygrad☆431Updated 2 weeks ago
- ☆94Updated 2 weeks ago
- ☆76Updated last month
- Open source machine learning accelerators☆387Updated last year
- Tenstorrent MLIR compiler☆199Updated this week
- [BRH YT CHANNEL] This repo contains all the code and ressources you need for the Zynq tutorials, ready to copy and paste.☆64Updated 3 months ago
- Learning about CUDA by writing PTX code.☆144Updated last year
- A really tiny autograd engine☆95Updated 5 months ago
- Machine-Learning Accelerator System Exploration Tools☆179Updated 3 weeks ago
- RDNA3 emulator☆54Updated 6 months ago
- Visualization of cache-optimized matrix multiplication☆155Updated 7 months ago
- Quantized LLM training in pure CUDA/C++.☆206Updated last week
- It's a core. Made on Twitch.☆264Updated 3 years ago
- Attention in SRAM on Tenstorrent Grayskull☆38Updated last year
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆276Updated 11 months ago
- A open source reimplementation of Google's Tensor Processing Unit (TPU).☆706Updated 7 years ago
- pytorch from scratch in pure C/CUDA and python☆41Updated last year
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆191Updated 2 years ago
- A machine learning accelerator core designed for energy-efficient AI at the edge.☆1,530Updated this week
- Tenstorrent TT-BUDA Repository☆316Updated 6 months ago