pku-liang / hestia
☆14Updated last month
Alternatives and similar repositories for hestia:
Users that are interested in hestia are comparing it to the libraries listed below
- ☆40Updated 3 months ago
- ArchExplorer: Microarchitecture Exploration Via Bottleneck Analysis☆31Updated last year
- The Next-gen Language & Compiler Powering Efficient Hardware Design☆27Updated 3 months ago
- Artifact evaluation of PLDI'24 paper "Allo: A Programming Model for Composable Accelerator Design"☆23Updated last year
- ☆27Updated 6 months ago
- A hardware synthesis framework with multi-level paradigm☆38Updated 3 months ago
- Domain-Specific Architecture Generator 2☆21Updated 2 years ago
- ☆50Updated last month
- An MLIR dialect to enable the efficient acceleration of ML model on CGRAs.☆58Updated 6 months ago
- HyFiSS: A Hybrid Fidelity Stall-Aware Simulator for GPGPUs☆30Updated 4 months ago
- ☆25Updated last year
- STONNE Simulator integrated into SST Simulator☆19Updated last year
- CGRA framework with vectorization support.☆29Updated 2 weeks ago
- A binary instrumentation tool to analyze load instructions in any off-the-shelf x86(-64) program. Described by Bera et al. in https://arx…☆20Updated 10 months ago
- EQueue Dialect☆40Updated 3 years ago
- ☆16Updated 3 years ago
- ☆30Updated last month
- Xiangshan deterministic workloads generator☆18Updated 2 months ago
- agile hardware-software co-design☆46Updated 3 years ago
- ☆14Updated 2 years ago
- CIRCT-based HLS compilation flows, debugging, and cosimulation tools.☆49Updated last year
- NPUsim: Full-Model, Cycle-Level, and Value-Aware Simulator for DNN Accelerators☆34Updated 4 months ago
- ☆19Updated last year
- A scheduler for spatial DNN accelerators that generate high-performance schedules in one shot using mixed integer programming (MIP)☆79Updated last year
- Cluster-level matrix unit integration into GPUs, implemented in Chipyard SoC☆26Updated last month
- About the source code of "Merging Similar Patterns for Hardware Prefetching" paper, which is accepted in MICRO 2022.☆14Updated 2 years ago
- ☆18Updated 2 months ago
- ☆11Updated 2 weeks ago
- Transactional Verilog design and Verilator Testbench for a RISC-V TensorCore Vector co-processor for reproducible linear algebra☆53Updated 3 years ago
- An LLVM pass that can generate CDFG and map the target loops onto a parameterizable CGRA.☆68Updated last week