cea-wind/SimpleTPU

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cea-wind/SimpleTPU)

cea-wind / SimpleTPU

A FPGA Based CNN accelerator, following Google's TPU V1.

☆175

Alternatives and similar repositories for SimpleTPU

Users that are interested in SimpleTPU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jofrfu / tinyTPU
View on GitHub
Implementation of a Tensor Processing Unit for embedded systems and the IoT.
☆571Jan 5, 2019Updated 7 years ago
cameronshinn / tiny-tpu
View on GitHub
Small-scale Tensor Processing Unit built on an FPGA
☆228Aug 4, 2019Updated 6 years ago
leo47007 / TPU-Tensor-Processing-Unit
View on GitHub
IC implementation of TPU
☆156Dec 18, 2019Updated 6 years ago
dldldlfma / super_small_toy_tpu
View on GitHub
☆55Jan 14, 2021Updated 5 years ago
embedeep / FREE-TPU-V3plus-for-FPGA
View on GitHub
FREE TPU V3plus for FPGA is the free version of a commercial AI processor (EEP-TPU) for Deep Learning EDGE Inference
☆176Jun 9, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
embedeep / Free-TPU
View on GitHub
Free TPU for FPGA with compiler supporting Pytorch/Caffe/Darknet/NCNN. An AI processor for using Xilinx FPGA to solve image classificatio…
☆273May 6, 2023Updated 3 years ago
charley871103 / TPU
View on GitHub
AI Chip project
☆34Jul 14, 2021Updated 5 years ago
xliu0709 / WinoCNN
View on GitHub
An HLS based winograd systolic CNN accelerator
☆54Jul 18, 2021Updated 5 years ago
lirui-shanghaitech / CNN-Accelerator-VLSI
View on GitHub
Convolutional accelerator kernel, target ASIC & FPGA
☆257Apr 10, 2023Updated 3 years ago
lirui-shanghaitech / A-convolution-kernel-implemented-by-Vivado-HLS
View on GitHub
This project implements a convolution kernel based on vivado HLS on zcu104
☆36Mar 15, 2020Updated 6 years ago
UCSBarchlab / OpenTPU
View on GitHub
A open source reimplementation of Google's Tensor Processing Unit (TPU).
☆775Dec 6, 2017Updated 8 years ago
horizon-research / systolic-array-dataflow-optimizer
View on GitHub
A general framework for optimizing DNN dataflow on systolic array
☆40Jan 2, 2021Updated 5 years ago
abdelazeem201 / Systolic-array-implementation-in-RTL-for-TPU
View on GitHub
IC implementation of Systolic Array for TPU
☆365Oct 21, 2024Updated last year
tirumalnaidu / opencl-hls-cnn-accelerator
View on GitHub
OpenCL HLS based CNN Accelerator on Intel DE10 Nano FPGA.
☆82Oct 3, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
mpskex / chisel-npu
View on GitHub
Chisel implementation of Neural Processing Unit for System on the Chip
☆34May 22, 2026Updated last month
19801201 / SpinalHDL_CNN_Accelerator
View on GitHub
CNN accelerator implemented with Spinal HDL
☆160Jan 29, 2024Updated 2 years ago
zjnyly / TeraFly
View on GitHub
[DATE'2025, TCAD'2025] Terafly : A Multi-Node FPGA Based Accelerator Design for Efficient Cooperative Inference in LLMs
☆38Nov 13, 2025Updated 8 months ago
jiaaom / HPDLA
View on GitHub
Systolic-array based Deep Learning Accelerator generator
☆29Dec 11, 2020Updated 5 years ago
Xilinx / finn-hlslib
View on GitHub
Vitis HLS Library for FINN
☆224May 27, 2026Updated last month
cjg91 / trans-fat
View on GitHub
An FPGA Accelerator for Transformer Inference
☆95Apr 29, 2022Updated 4 years ago
cornell-brg / pymtl-tut-hls
View on GitHub
Tutorial for integrating PyMTL and Vivado HLS
☆20Apr 17, 2016Updated 10 years ago
GATECH-EIC / ViTCoD
View on GitHub
[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
☆133Jun 27, 2023Updated 3 years ago
CASR-HKU / MSD-FCCM23
View on GitHub
Open-source of MSD framework
☆16Sep 12, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
mit-han-lab / spatten
View on GitHub
[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
☆136Aug 27, 2024Updated last year
Dazhuzhu-github / systolic-array
View on GitHub
verilog实现TPU中的脉动阵列计算卷积的module
☆174May 10, 2025Updated last year
pp-Innovate / FPGA-ZynqNet
View on GitHub
FPGA-based ZynqNet CNN accelerator developed by Vivado_HLS
☆116Jun 24, 2017Updated 9 years ago
jasonlin316 / Systolic-Array-for-Smith-Waterman
View on GitHub
This work implements a dynamic programming algorithm for performing local sequence alignment. Through parallelism, it can run 136X times …
☆28Jul 4, 2019Updated 7 years ago
doonny / PipeCNN
View on GitHub
An OpenCL-based FPGA Accelerator for Convolutional Neural Networks
☆1,384Feb 14, 2022Updated 4 years ago
yuyuranium / FPGA-Project-2022-simple-tpu
View on GitHub
Systolic array based simple TPU for CNN on PYNQ-Z2
☆50Jun 24, 2022Updated 4 years ago
CASR-HKU / AGNA-FCCM2023
View on GitHub
☆12Nov 24, 2023Updated 2 years ago
DPCEKY / systolic-array
View on GitHub
HLS implemented systolic array structure
☆41Nov 13, 2017Updated 8 years ago
GATECH-EIC / torchshiftadd
View on GitHub
An open-sourced PyTorch library for developing energy efficient multiplication-less models and applications.
☆14Feb 3, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
AIC2021 / AIC2021_TPU_Template
View on GitHub
Template for project1 TPU
☆23May 1, 2021Updated 5 years ago
ucb-bar / gemmini
View on GitHub
Berkeley's Spatial Array Generator
☆1,402Jun 30, 2026Updated 3 weeks ago
patryk-oleniuk / cnn_hw_accelerator
View on GitHub
FPGA accelerator and port of the emotion recognition CNN running in C on Xilinx ZYNQ
☆21Jun 3, 2019Updated 7 years ago
sjtu-zhao-lab / SALO
View on GitHub
An efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences
☆32Mar 7, 2024Updated 2 years ago
ZhaoqxCN / PYNQ-CNN-ATTEMPT
View on GitHub
Some attempts to build CNN on PYNQ.
☆25Jun 28, 2019Updated 7 years ago
bonanyan / attentionlego
View on GitHub
Attentionlego
☆13Jan 24, 2024Updated 2 years ago
sharc-lab / Edge-MoE
View on GitHub
Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts
☆140May 10, 2024Updated 2 years ago