twaclaw / matmult
A floating-point matrix multiplication implemented in hardware
☆31Updated 4 years ago
Alternatives and similar repositories for matmult:
Users that are interested in matmult are comparing it to the libraries listed below
- High-Performance Sparse Linear Algebra on HBM-Equipped FPGAs Using HLS☆84Updated 3 months ago
- ☆33Updated last week
- A Spatial Accelerator Generation Framework for Tensor Algebra.☆54Updated 3 years ago
- Provides the hardware code for the paper "EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerator…☆24Updated 4 years ago
- ☆56Updated 4 years ago
- ☆27Updated 5 years ago
- FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks☆42Updated 2 years ago
- [TECS'23] A project on the co-design of Accelerators and CNNs.☆19Updated 2 years ago
- SAURIA (Systolic-Array tensor Unit for aRtificial Intelligence Acceleration) is an open-source Convolutional Neural Network accelerator b…☆26Updated 3 months ago
- Systolic-array based Deep Learning Accelerator generator☆25Updated 4 years ago
- cycle accurate Network-on-Chip Simulator☆25Updated last year
- TensorCore Vector Processor for Deep Learning - Google Summer of Code Project☆21Updated 3 years ago
- ☆33Updated 3 years ago
- CNN accelerator☆27Updated 7 years ago
- Fast, Accurate and Convenient Light-Weight HLS Framework for Academic Design Space Exploration and Evaluation. (LLVM-11)☆58Updated 2 years ago
- An FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).☆75Updated 5 months ago
- HLS for Networks-on-Chip☆32Updated 3 years ago
- A DSL for Systolic Arrays☆78Updated 6 years ago
- course design☆22Updated 6 years ago
- IC implementation of TPU☆92Updated 5 years ago
- Introductory examples for using PYNQ with Alveo☆49Updated last year
- ☆52Updated 11 months ago
- Contains FPGA benchmarks for Vivado HLS and Catapult HLS☆26Updated 4 years ago
- Fork of seldridge/rocket-rocc-examples with tests for a systolic array based matmul accelerator☆54Updated last month
- Tutorials on HLS Design☆51Updated 5 years ago
- An HLS based winograd systolic CNN accelerator☆49Updated 3 years ago
- ☆15Updated 3 years ago
- ☆29Updated last month
- The Verilog source code for DRUM approximate multiplier.☆29Updated last year
- FleetRec: Large-Scale Recommendation Inference on Hybrid GPU-FPGA Clusters☆16Updated 3 years ago