DeepWok / maseLinks

Machine-Learning Accelerator System Exploration Tools

☆183

Alternatives and similar repositories for mase

Users that are interested in mase are comparing it to the libraries listed below

Sorting:

kachris / survey_HA_LLM
A survey on Hardware Accelerated LLMs
☆60Updated 10 months ago
actlab-genesys / GeneSys
An open-source parameterizable NPU generator with full-stack multi-target compilation stack for intelligent workloads.
☆69Updated 2 months ago
MartaAndronic / NeuraLUT
NeuraLUT-Assemble
☆43Updated 3 months ago
KULeuven-MICAS / stream
Multi-core HW accelerator mapping optimization framework for layer-fused ML workloads.
☆64Updated 5 months ago
arc-research-lab / CHARM
CHARM: Composing Heterogeneous Accelerators on Heterogeneous SoC Architecture
☆163Updated this week
KULeuven-MICAS / zigzag
HW Architecture-Mapping Design Space Exploration Framework for Deep Learning Accelerators
☆168Updated last month
PSAL-POSTECH / ONNXim
ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference
☆171Updated last week
cornell-zhang / allo
Allo Accelerator Design and Programming Framework
☆306Updated this week
makslevental / openhls
PyTorch model to RTL flow for low latency inference
☆130Updated last year
HLSTransform / submission
☆113Updated last year
mit-emze / cimloop
☆74Updated 2 months ago
UIUC-ChenLab / ScaleHLS-HIDA
☆61Updated 8 months ago
georgia-tech-synergy-lab / SIGMA
RTL implementation of Flex-DPE.
☆115Updated 5 years ago
maeri-project / FEATHER
A Reconfigurable Accelerator with Data Reordering Support for Low-Cost On-Chip Dataflow Switching
☆71Updated last month
ebby-s / MX-for-FPGA
Implementation of Microscaling data formats in SystemVerilog.
☆28Updated 5 months ago
hngenc / systolic-array
A DSL for Systolic Arrays
☆82Updated 6 years ago
KastnerRG / cgra4ml
An Open Workflow to Build Custom SoCs and run Deep Models at the Edge
☆97Updated this week
UCLA-VAST / Stream-HLS
An MLIR Complier for PyTorch/C/C++ Codes into HLS Dataflow Designs
☆54Updated 4 months ago
pulp-platform / Deeploy
DNN Compiler for Heterogeneous SoCs
☆55Updated 2 weeks ago
cornell-zhang / HiSparse
High-Performance Sparse Linear Algebra on HBM-Equipped FPGAs Using HLS
☆95Updated last year
arc-research-lab / SSR
SSR: Spatial Sequential Hybrid Architecture for Latency Throughput Tradeoff in Transformer Acceleration (Full Paper Accepted in FPGA'24)
☆35Updated this week
UCLA-VAST / AutoSA
AutoSA: Polyhedral-Based Systolic Array Compiler
☆230Updated 3 years ago
Accelergy-Project / accelergy-timeloop-infrastructure
Linux docker for the DNN accelerator exploration infrastructure composed of Accelergy and Timeloop
☆60Updated last month
VCA-EPFL / FSA
FSA: Fusing FlashAttention within a Single Systolic Array
☆68Updated 3 months ago
sharc-lab / LightningSim
A fast, accurate trace-based simulator for High-Level Synthesis.
☆72Updated 8 months ago
linghaosong / Sextans
An FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).
☆91Updated last year
Xilinx / logicnets
Train and deploy LUT-based neural networks on FPGAs
☆102Updated last year
VeriGOOD-ML / public
☆64Updated 7 months ago
MartaAndronic / PolyLUT
PolyLUT is the first quantized neural network training methodology that maps a neuron to a LUT while using multivariate polynomial functi…
☆54Updated last year
tancheng / mlir-cgra
An MLIR dialect to enable the efficient acceleration of ML model on CGRAs.
☆64Updated last year