bsc-loca/sauria

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bsc-loca/sauria)

bsc-loca / sauria

SAURIA (Systolic-Array tensor Unit for aRtificial Intelligence Acceleration) is an open-source Convolutional Neural Network accelerator based on a GeMM systolic array engine.

☆109

Alternatives and similar repositories for sauria

Users that are interested in sauria are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pulp-platform / redmule
View on GitHub
GEMM and GEMM-Ops accelerator for PULP systems
☆111Updated this week
skudlur / pes_sysarray
View on GitHub
Systolic Array implementation for ASIC Course
☆15Nov 26, 2023Updated 2 years ago
bsc-loca / sargantana
View on GitHub
☆154Jun 8, 2026Updated last month
abdelazeem201 / Systolic-array-implementation-in-RTL-for-TPU
View on GitHub
IC implementation of Systolic Array for TPU
☆367Oct 21, 2024Updated last year
yuyuranium / FPGA-Project-2022-simple-tpu
View on GitHub
Systolic array based simple TPU for CNN on PYNQ-Z2
☆51Jun 24, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
pulp-platform / iDMA
View on GitHub
A modular, parametrizable, and highly flexible Data Movement Accelerator (DMA)
☆225Updated this week
pulp-platform / ITA
View on GitHub
☆80Apr 22, 2025Updated last year
YqGe585 / Neural-Processing-Unit-on-FPGA
View on GitHub
Superscalar Out-of-Order NPU Design on FPGA
☆16May 17, 2024Updated 2 years ago
merledu / magma-si
View on GitHub
Matrix Accelerator Generator for GeMM Operations based on SIGMA Architecture in CHISEL HDL
☆15Mar 21, 2024Updated 2 years ago
thousrm / universal_NPU-CNN_accelerator
View on GitHub
hardware design of universal NPU(CNN accelerator) for various convolution neural network
☆179Mar 5, 2025Updated last year
lllibano / SystolicArray
View on GitHub
A parametric RTL code generator of an efficient integer MxM Systolic Array implementation for Xilinx FPGAs.
☆37Aug 28, 2025Updated 11 months ago
zlagpacan / LOROF
View on GitHub
Linux on RISC-V on FPGA (LOROF): RV64GC Sv39 Quad-Core Superscalar Out-of-Order Virtual Memory CPU
☆18Updated this week
pulp-platform / hwpe-stream
View on GitHub
IPs for data-plane integration of Hardware Processing Engines (HWPEs) within a PULP system
☆21Updated this week
intel / fpga-npu
View on GitHub
☆261Apr 8, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
shrutiprakashgupta / RISCV_Formal_Verification
View on GitHub
Formal Verification of RISC V IM Processor
☆11Mar 27, 2022Updated 4 years ago
harishsg993010 / tiny-NPU
View on GitHub
opensource NPU for LLM inference (this run gpt2)
☆216Feb 16, 2026Updated 5 months ago
maeri-project / MAERI_bsv
View on GitHub
MAERI: A DNN accelerator with reconfigurable interconnects to support flexible dataflow (http://synergy.ece.gatech.edu/tools/maeri/)
☆67Sep 24, 2021Updated 4 years ago
PSAL-POSTECH / ONNXim
View on GitHub
ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference
☆209Jan 8, 2026Updated 6 months ago
x-heep / x-heep
View on GitHub
eXtensible Heterogeneous Energy-Efficient Platform based on RISC-V
☆288Updated this week
KULeuven-MICAS / snax_cluster
View on GitHub
A heterogeneous accelerator-centric compute cluster
☆49Updated this week
nikhiledm97 / TheGEMMCoreProject
View on GitHub
SystemVerilog Implementations of CUDA/TensorCore/TPU GEMM Operations
☆22Apr 12, 2026Updated 3 months ago
pulp-platform / neureka
View on GitHub
2-8bit weights, 8-bit activations flexible Neural Processing Engine for PULP clusters
☆34May 20, 2026Updated 2 months ago
kagandikmen / TPU.sv
View on GitHub
Anatomy of a powerhouse: SystemVerilog TPU based on Google TPU v1
☆23Nov 9, 2025Updated 8 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
yonseicasl / NPUsim
View on GitHub
NPUsim: Full-Model, Cycle-Level, and Value-Aware Simulator for DNN Accelerators
☆55Jan 2, 2025Updated last year
vsuresh95 / gemm_accelerator_stratus
View on GitHub
A hardware accelerator for General Matrix Multiply, developed in SystemC using ESP.
☆20May 26, 2021Updated 5 years ago
ic-lab-duth / DRIM-S
View on GitHub
DUTH RISC-V Superscalar Microprocessor
☆35Oct 23, 2024Updated last year
ucb-bar / gemmini
View on GitHub
Berkeley's Spatial Array Generator
☆1,406Jun 30, 2026Updated 3 weeks ago
scalesim-project / SCALE-Sim
View on GitHub
Repository to host and maintain SCALE-Sim code
☆502Jun 28, 2026Updated last month
KULeuven-MICAS / zigzag-llm
View on GitHub
Model LLM inference on single-core dataflow accelerators
☆19Dec 16, 2025Updated 7 months ago
pulp-platform / snitch_cluster
View on GitHub
An energy-efficient RISC-V floating-point compute cluster.
☆141Updated this week
google-coral / coralnpu
View on GitHub
A machine learning accelerator core designed for energy-efficient AI at the edge.
☆2,485Updated this week
taichi-ishitani / rice
View on GitHub
☆22Sep 26, 2025Updated 10 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Dazhuzhu-github / systolic-array
View on GitHub
verilog实现TPU中的脉动阵列计算卷积的module
☆174May 10, 2025Updated last year
UofT-HPRC / Tbps_CRC
View on GitHub
A SytemVerilog implementation of Cyclic Redundancy Check runs at up to Terabits per second
☆20Oct 23, 2023Updated 2 years ago
pulp-platform / rbe
View on GitHub
Reconfigurable Binary Engine
☆18Mar 23, 2021Updated 5 years ago
pulp-platform / common_cells
View on GitHub
Common SystemVerilog components
☆772Updated this week
freecores / theia_gpu
View on GitHub
Theia: ray graphic processing unit
☆20Jul 17, 2014Updated 12 years ago
yhinai / TensorGPGPU
View on GitHub
RISC-V vector and tensor compute extensions for Vortex GPGPU acceleration for ML workloads. Optimized for transformer models, CNNs, and g…
☆25Apr 25, 2025Updated last year
yuezuegu / sosa-compiler
View on GitHub
Repository for compilation and cycle-accurate simulator for scale-out systolic arrays
☆16Jan 4, 2023Updated 3 years ago