tiny-tpu-v2 / tiny-tpuLinks
A minimal tensor processing unit (TPU), inspired by Google's TPU V2 and V1
☆1,161Updated 5 months ago
Alternatives and similar repositories for tiny-tpu
Users that are interested in tiny-tpu are comparing it to the libraries listed below
Sorting:
- A machine learning accelerator core designed for energy-efficient AI at the edge.☆2,040Updated this week
- A open source reimplementation of Google's Tensor Processing Unit (TPU).☆731Updated 8 years ago
- A minimal Tensor Processing Unit (TPU) inspired by Google's TPUv1.☆194Updated last year
- ☆308Updated this week
- Run 64-bit Linux on LiteX + RocketChip☆209Updated 3 months ago
- CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-base…☆823Updated 3 weeks ago
- Tenstorrent TT-BUDA Repository☆314Updated 10 months ago
- OpenSource GPU, in Verilog, loosely based on RISC-V ISA☆1,239Updated last year
- Machine-Learning Accelerator System Exploration Tools☆197Updated 2 weeks ago
- ☆161Updated last month
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆185Updated this week
- Nvidia Instruction Set Specification Generator☆311Updated last year
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆580Updated this week
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆175Updated last year
- Berkeley's Spatial Array Generator☆1,215Updated this week
- Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O☆550Updated 4 months ago
- a mini 2x2 systolic array and PE demo☆68Updated last month
- TT-NN operator library, and TT-Metalium low level kernel programming model.☆1,343Updated this week
- ☆119Updated 2 years ago
- Open source machine learning accelerators☆397Updated last year
- A custom AI chip to be taped out soon!☆41Updated last month
- Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.☆440Updated last month
- Ocelot: The Berkeley Out-of-Order Machine With V-EXT support☆226Updated 3 weeks ago
- Allo Accelerator Design and Programming Framework (PLDI'24)☆343Updated this week
- Fast and Furious AMD Kernels☆348Updated 2 weeks ago
- ☆1,885Updated this week
- This project aims to enable language model inference on FPGAs, supporting AI applications in edge devices and environments with limited r…☆171Updated last year
- Algebraic enhancements for GEMM & AI accelerators☆287Updated 11 months ago
- GPU documentation for humans☆518Updated last week
- Self checking RISC-V directed tests☆119Updated 8 months ago