brigio345 / DaCH
DaCH: dataflow cache for high-level synthesis.
☆14Updated last year
Related projects: ⓘ
- eyeriss-chisel3☆35Updated 2 years ago
- Implementations of Buffets, which are efficient, composable idioms for implementing Explicit Decoupled Data Orchestration.☆62Updated 5 years ago
- High-Performance Sparse Linear Algebra on HBM-Equipped FPGAs Using HLS☆77Updated last month
- [FPGA'21] Microbenchmarks for Demystifying the Memory System of Modern Datacenter FPGAs for Software Programmers☆28Updated 2 years ago
- ☆65Updated last year
- An Automated Framework for Generic Graph Neural Network Accelerator Generation, Simulation, and Optimization☆18Updated 9 months ago
- A systolic array simulator for multi-cycle MACs and varying-byte words, with the paper accepted to HPCA 2022.☆60Updated 2 years ago
- MAERI: A DNN accelerator with reconfigurable interconnects to support flexible dataflow (http://synergy.ece.gatech.edu/tools/maeri/)☆56Updated 2 years ago
- 16-bit Adder Multiplier hardware on Digilent Basys 3☆62Updated last year
- ☆23Updated 3 years ago
- view at https://xupsh.github.io/ccc2021/☆24Updated 2 years ago
- A DSL for Systolic Arrays☆73Updated 5 years ago
- An integrated CGRA design framework☆82Updated 9 months ago
- An FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).☆62Updated last month
- This is a general-purpose simulator for unary computing based on PyTorch, with the paper accepted to ISCA 2020 and awarded IEEE Micro Top…☆39Updated last year
- ☆28Updated 2 weeks ago
- Systolic matrix multiplication kernel implemented on Xilinx PYNQ FPGA board☆9Updated 4 years ago
- ☆58Updated 5 years ago
- A Reconfigurable Accelerator with Data Reordering Support for Low-Cost On-Chip Dataflow Switching☆25Updated last month
- 32 - bit floating point Multiplier Accumulator Unit (MAC)☆21Updated 3 years ago
- Systolic array implementations for Cholesky, LU, and QR decomposition☆38Updated 5 years ago
- An HLS based winograd systolic CNN accelerator☆46Updated 3 years ago
- CHARM: Composing Heterogeneous Accelerators on Versal ACAP Architecture☆119Updated last month
- The RAD flow is an open-source academic architecture exploration and evaluation flow for novel beyond-FPGA reconfigurable acceleration de…☆25Updated last week
- An LLVM pass that can generate CDFG and map the target loops onto a parameterizable CGRA.☆53Updated this week
- dMazeRunner: Dataflow acceleration optimization infrastructure for coarse-grained programmable accelerators☆44Updated 2 years ago
- This is a verilog implementation of 4x4 systolic array multiplier☆29Updated 3 years ago
- ☆63Updated 9 years ago
- ☆47Updated 8 months ago
- Template-based Reconfigurable Architecture Modeling Framework☆13Updated 2 years ago