jofrfu/tinyTPU

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jofrfu/tinyTPU)

jofrfu / tinyTPU

Implementation of a Tensor Processing Unit for embedded systems and the IoT.

☆559

Alternatives and similar repositories for tinyTPU

Users that are interested in tinyTPU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cameronshinn / tiny-tpu
View on GitHub
Small-scale Tensor Processing Unit built on an FPGA
☆221Aug 4, 2019Updated 6 years ago
leo47007 / TPU-Tensor-Processing-Unit
View on GitHub
IC implementation of TPU
☆153Dec 18, 2019Updated 6 years ago
embedeep / Free-TPU
View on GitHub
Free TPU for FPGA with compiler supporting Pytorch/Caffe/Darknet/NCNN. An AI processor for using Xilinx FPGA to solve image classificatio…
☆270May 6, 2023Updated 2 years ago
cea-wind / SimpleTPU
View on GitHub
A FPGA Based CNN accelerator, following Google's TPU V1.
☆175Jul 25, 2019Updated 6 years ago
embedeep / FREE-TPU-V3plus-for-FPGA
View on GitHub
FREE TPU V3plus for FPGA is the free version of a commercial AI processor (EEP-TPU) for Deep Learning EDGE Inference
☆174Jun 9, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
UCSBarchlab / OpenTPU
View on GitHub
A open source reimplementation of Google's Tensor Processing Unit (TPU).
☆747Dec 6, 2017Updated 8 years ago
abdelazeem201 / Systolic-array-implementation-in-RTL-for-TPU
View on GitHub
IC implementation of Systolic Array for TPU
☆353Oct 21, 2024Updated last year
dldldlfma / super_small_toy_tpu
View on GitHub
☆53Jan 14, 2021Updated 5 years ago
charley871103 / TPU
View on GitHub
AI Chip project
☆34Jul 14, 2021Updated 4 years ago
ChrisZonghaoLi / cnn_conv_accelerator
View on GitHub
A Fix-pointed Rudimentary CNN Convolution Accelerator
☆16Oct 7, 2020Updated 5 years ago
Dazhuzhu-github / systolic-array
View on GitHub
verilog实现TPU中的脉动阵列计算卷积的module
☆167May 10, 2025Updated 11 months ago
8krisv / CNN-ACCELERATOR
View on GitHub
Hardware accelerator for convolutional neural networks
☆70Aug 9, 2022Updated 3 years ago
IBM / AccDNN
View on GitHub
A compiler from AI model to RTL (Verilog) accelerator in FPGA hardware with auto design space exploration.
☆450Dec 2, 2019Updated 6 years ago
horizon-research / systolic-array-dataflow-optimizer
View on GitHub
A general framework for optimizing DNN dataflow on systolic array
☆39Jan 2, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
doonny / PipeCNN
View on GitHub
An OpenCL-based FPGA Accelerator for Convolutional Neural Networks
☆1,374Feb 14, 2022Updated 4 years ago
taoyilee / clacc
View on GitHub
Deep Learning Accelerator (Convolution Neural Networks)
☆200Dec 15, 2017Updated 8 years ago
intel / fpga-npu
View on GitHub
☆249Apr 8, 2024Updated 2 years ago
ucb-bar / gemmini
View on GitHub
Berkeley's Spatial Array Generator
☆1,294Mar 29, 2026Updated last month
danielholanda / LeFlow
View on GitHub
Enabling Flexible FPGA High-Level Synthesis of Tensorflow Deep Neural Networks
☆623Jan 3, 2020Updated 6 years ago
ac-optimus / Convolution-using-systolic-arrays
View on GitHub
☆73Dec 12, 2018Updated 7 years ago
lirui-shanghaitech / CNN-Accelerator-VLSI
View on GitHub
Convolutional accelerator kernel, target ASIC & FPGA
☆254Apr 10, 2023Updated 3 years ago
cxdzyq1110 / NPU_on_FPGA
View on GitHub
在FPGA上面实现一个NPU计算单元。能够执行矩阵运算（ADD/ADDi/ADDs/MULT/MULTi/DOT等）、图像处理运算（CONV/POOL等）、非线性映射（RELU/TANH/SIGM等）。
☆308Aug 16, 2018Updated 7 years ago
thousrm / universal_NPU-CNN_accelerator
View on GitHub
hardware design of universal NPU(CNN accelerator) for various convolution neural network
☆171Mar 5, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hsharma35 / dnnweaver2
View on GitHub
Open Source Specialized Computing Stack for Accelerating Deep Neural Networks.
☆228Apr 22, 2019Updated 7 years ago
freecores / theia_gpu
View on GitHub
Theia: ray graphic processing unit
☆20Jul 17, 2014Updated 11 years ago
LeiWang1999 / ZYNQ-NVDLA
View on GitHub
NVDLA (An Opensource DL Accelerator Framework) implementation on FPGA.
☆385Dec 27, 2023Updated 2 years ago
arasi15 / CNN-Accelerator-Implementation-based-on-Eyerissv2
View on GitHub
☆125Jul 22, 2020Updated 5 years ago
spcl / gemm_hls
View on GitHub
Scalable systolic array-based matrix-matrix multiplication implemented in Vivado HLS for Xilinx FPGAs.
☆382Jan 20, 2025Updated last year
hsiehong / tpu
View on GitHub
AIChip 2021 project, NCKU
☆18May 6, 2021Updated 4 years ago
karthisugumar / CSE240D-Hierarchical_Mesh_NoC-Eyeriss_v2
View on GitHub
A SystemVerilog implementation of Row-Stationary dataflow and Hierarchical Mesh Network-on-Chip Architecture based on Eyeriss CNN Acceler…
☆183Dec 14, 2019Updated 6 years ago
maomran / softmax
View on GitHub
Verilog implementation of Softmax function
☆82Jul 27, 2022Updated 3 years ago
YqGe585 / Neural-Processing-Unit-on-FPGA
View on GitHub
Superscalar Out-of-Order NPU Design on FPGA
☆14May 17, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
jneless / EyerissF
View on GitHub
An Eyeriss Chip (researched by MIT, a CNN accelerator) simulator and New DNN framework "Hive"
☆224Dec 22, 2020Updated 5 years ago
georgia-tech-synergy-lab / SIGMA
View on GitHub
RTL implementation of Flex-DPE.
☆116Feb 22, 2020Updated 6 years ago
CASR-HKU / MSD-FCCM23
View on GitHub
Open-source of MSD framework
☆16Sep 12, 2023Updated 2 years ago
mpskex / chisel-npu
View on GitHub
Chisel implementation of Neural Processing Unit for System on the Chip
☆27Apr 22, 2026Updated last week
stillwater-sc / RISC-V-TensorCore
View on GitHub
Transactional Verilog design and Verilator Testbench for a RISC-V TensorCore Vector co-processor for reproducible linear algebra
☆63Dec 19, 2021Updated 4 years ago
apache / tvm-vta
View on GitHub
Open, Modular, Deep Learning Accelerator
☆341Apr 10, 2024Updated 2 years ago
evan199893 / TPU_systolic_array_HW_accelerator
View on GitHub
Tensor Processing Unit implementation in Verilog
☆14Mar 18, 2025Updated last year