eevaain / tiny-tpu-oldLinks
A minimal Tensor Processing Unit (TPU) inspired by Google's TPUv1.
☆189Updated last year
Alternatives and similar repositories for tiny-tpu-old
Users that are interested in tiny-tpu-old are comparing it to the libraries listed below
Sorting:
- A minimal tensor processing unit (TPU), inspired by Google's TPU V2 and V1☆1,042Updated 3 months ago
- ☆80Updated 2 weeks ago
- parallelized hyperdimensional tictactoe☆125Updated last year
- could we make an ml stack in 100,000 lines of code?☆46Updated last year
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆171Updated last year
- Run 64-bit Linux on LiteX + RocketChip☆207Updated last month
- Open source machine learning accelerators☆392Updated last year
- Nvidia Instruction Set Specification Generator☆301Updated last year
- Tensor library with autograd using only Rust's standard library☆70Updated last year
- Verilog package manager written in Rust☆143Updated last year
- Solve puzzles to improve your tinygrad skills!☆164Updated last month
- A open source reimplementation of Google's Tensor Processing Unit (TPU).☆712Updated 8 years ago
- Machine-Learning Accelerator System Exploration Tools☆183Updated last month
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 8 months ago
- Learning about CUDA by writing PTX code.☆148Updated last year
- A really tiny autograd engine☆96Updated 6 months ago
- It's a core. Made on Twitch.☆265Updated 4 years ago
- Visualization of cache-optimized matrix multiplication☆156Updated 8 months ago
- ☆449Updated 8 months ago
- pytorch from scratch in pure C/CUDA and python☆41Updated last year
- RISC-V XV6/Linux SoC, marchID: 0x2b☆1,000Updated last week
- Tutorials on tinygrad☆441Updated last month
- Tenstorrent MLIR compiler☆217Updated this week
- Tenstorrent TT-BUDA Repository☆313Updated 8 months ago
- ☆113Updated last year
- tiny code to access tenstorrent blackhole☆61Updated 6 months ago
- Attention in SRAM on Tenstorrent Grayskull☆39Updated last year
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆196Updated 2 years ago
- CORE-V Wally is a configurable RISC-V Processor associated with RISC-V System-on-Chip Design textbook. Contains a 5-stage pipeline, suppo…☆450Updated this week