Zhao-Dongyu / sgemm_riscvLinks
This project records the process of optimizing SGEMM (single-precision floating point General Matrix Multiplication) on the riscv platform.
☆22Updated 8 months ago
Alternatives and similar repositories for sgemm_riscv
Users that are interested in sgemm_riscv are comparing it to the libraries listed below
Sorting:
- ARIES: An Agile MLIR-Based Compilation Flow for Reconfigurable Devices with AI Engines (FPGA 2025 Best Paper Nominee)☆37Updated last week
- ☆37Updated last year
- Ventus GPGPU ISA Simulator Based on Spike☆45Updated last month
- A collection of RISC-V Vector (RVV) benchmarks to help developers write portably performant RVV code☆120Updated last month
- Cycle-accurate C++ & SystemC simulator for the RISC-V GPGPU Ventus☆28Updated last month
- ☆104Updated this week
- LLVM OpenCL C compiler suite for ventus GPGPU☆52Updated 3 weeks ago
- RISC-V Matrix Specification☆22Updated 8 months ago
- FlexGripPlus: an open-source GPU model for reliability evaluation and micro architectural simulation☆108Updated 2 years ago
- Cluster-level matrix unit integration into GPUs, implemented in Chipyard SoC☆36Updated 2 months ago
- RiVEC Bencmark Suite☆120Updated 8 months ago
- PiDRAM is the first flexible end-to-end framework that enables system integration studies and evaluation of real Processing-using-Memory …☆69Updated last year
- Chisel RISC-V Vector 1.0 Implementation☆108Updated 3 months ago
- Lab assignments for the Agile Hardware Design course☆16Updated 2 months ago
- ☆26Updated this week
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆87Updated 2 years ago
- ☆60Updated this week
- RISC-V Integrated Matrix Development Repository☆16Updated 10 months ago
- An LLVM pass that can generate CDFG and map the target loops onto a parameterizable CGRA.☆74Updated last week
- PLCT实验室 rvv-llvm 实现配套的 benchmark / testcases☆22Updated 4 years ago
- Heterogeneous Research Platform (HERO) for exploration of heterogeneous computers consisting of programmable many-core accelerators and a…☆111Updated last year
- ☆53Updated 2 weeks ago
- Example for running IREE in a bare-metal Arm environment.☆38Updated 3 weeks ago
- some sample caffemodel, prototxt, test images and pre compiled loadabes .☆13Updated 4 years ago
- ☆33Updated 4 months ago
- NPUsim: Full-Model, Cycle-Level, and Value-Aware Simulator for DNN Accelerators☆39Updated 7 months ago
- Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators☆97Updated last month
- Fast and accurate DRAM power and energy estimation tool☆173Updated last week
- ☆71Updated 10 months ago
- ☆92Updated last year