Alanma23 / tinytinyTPULinks
a mini 2x2 systolic array and PE demo
☆39Updated last week
Alternatives and similar repositories for tinytinyTPU
Users that are interested in tinytinyTPU are comparing it to the libraries listed below
Sorting:
- The Riallto Open Source Project from AMD☆83Updated 8 months ago
- General Matrix Multiplication using NVIDIA Tensor Cores☆27Updated 11 months ago
- A high-efficiency system-on-chip for floating-point compute workloads.☆44Updated 11 months ago
- Open-source RTL logic simulator with CUDA acceleration☆246Updated 2 months ago
- ☆35Updated 10 months ago
- Machine-Learning Accelerator System Exploration Tools☆183Updated last week
- A minimal Tensor Processing Unit (TPU) inspired by Google's TPUv1.☆192Updated last year
- ☆117Updated last year
- User-Mode Driver for Tenstorrent hardware☆36Updated this week
- ☆27Updated 9 months ago
- Universal Memory Interface (UMI)☆155Updated this week
- Attention in SRAM on Tenstorrent Grayskull☆39Updated last year
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆171Updated last year
- Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.☆48Updated this week
- ☆30Updated this week
- Fork of LLVM to support AMD AIEngine processors☆178Updated this week
- DNN Compiler for Heterogeneous SoCs☆58Updated last week
- Tenstorrent MLIR compiler☆226Updated this week
- ☆35Updated this week
- News and Paper Collections for Machine Learning Hardware☆22Updated 3 weeks ago
- Verilog package manager written in Rust☆143Updated last year
- Nvidia Instruction Set Specification Generator☆306Updated last year
- Self checking RISC-V directed tests☆118Updated 6 months ago
- ☆173Updated 2 years ago
- DHLS (Dynamic High-Level Synthesis) compiler based on MLIR☆155Updated this week
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆53Updated this week
- ☆89Updated last week
- This project aims to enable language model inference on FPGAs, supporting AI applications in edge devices and environments with limited r…☆169Updated last year
- Custom PTX Instruction Benchmark☆137Updated 10 months ago
- Example for running IREE in a bare-metal Arm environment.☆40Updated 5 months ago