kazutomo / Chisel-MatMulLinks
☆18Updated last week
Alternatives and similar repositories for Chisel-MatMul
Users that are interested in Chisel-MatMul are comparing it to the libraries listed below
Sorting:
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆88Updated 2 years ago
- An LLVM pass that can generate CDFG and map the target loops onto a parameterizable CGRA.☆79Updated last month
- An MLIR dialect to enable the efficient acceleration of ML model on CGRAs.☆65Updated last year
- PyTorch model to RTL flow for low latency inference☆131Updated last year
- AutoSA: Polyhedral-Based Systolic Array Compiler☆237Updated 3 years ago
- Release of stream-specialization software/hardware stack.☆121Updated 2 years ago
- A DSL for Systolic Arrays☆83Updated 7 years ago
- Cluster-level matrix unit integration into GPUs, implemented in Chipyard SoC☆46Updated 3 weeks ago
- ☆36Updated 4 years ago
- FlexGripPlus: an open-source GPU model for reliability evaluation and micro architectural simulation☆118Updated 2 years ago
- A GPU acceleration flow for RTL simulation with batch stimulus☆117Updated last year
- Ventus GPGPU ISA Simulator Based on Spike☆48Updated last month
- ☆62Updated 10 months ago
- CIRCT-based HLS compilation flows, debugging, and cosimulation tools.☆53Updated 2 years ago
- Hands-on experience programming AI Engines using Vitis Unified Software Platform☆40Updated last year
- ☆11Updated 3 years ago
- RISC-V Matrix Specification☆23Updated last year
- ☆109Updated last year
- A matrix extension proposal for AI applications under RISC-V architecture☆161Updated 11 months ago
- ☆90Updated this week
- A scheduler for spatial DNN accelerators that generate high-performance schedules in one shot using mixed integer programming (MIP)☆85Updated 2 years ago
- PiDRAM is the first flexible end-to-end framework that enables system integration studies and evaluation of real Processing-using-Memory …☆70Updated 2 years ago
- ☆62Updated this week
- Implementations of Buffets, which are efficient, composable idioms for implementing Explicit Decoupled Data Orchestration.☆82Updated 6 years ago
- Full Support 32bit RISC-V in LLVM and CLANG for Vector Extension☆44Updated 5 years ago
- A simple MIPS-like CPU demo in C++ for Xilinx Vivado HLS☆18Updated 6 years ago
- [FPGA 2022, Best Paper Award] Parallel placement and routing of Vivado HLS dataflow designs.☆128Updated 3 years ago
- ☆52Updated last year
- Linux docker for the DNN accelerator exploration infrastructure composed of Accelergy and Timeloop☆62Updated 3 months ago
- A speculative mechanism to accelerate long-latency off-chip load requests by removing on-chip cache access latency from their critical pa…☆76Updated 5 months ago