SwarmArch / T4
Code released to accompany the ISCA paper: "T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware"
☆28Updated 3 years ago
Alternatives and similar repositories for T4:
Users that are interested in T4 are comparing it to the libraries listed below
- ☆13Updated 4 years ago
- ☆28Updated 2 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆28Updated 5 months ago
- Polyhedral High-Level Synthesis in MLIR☆30Updated last year
- A PIM instrumentation, compilation, execution, simulation, and evaluation repository for BLIMP-style architectures.☆17Updated 2 years ago
- ordspecsim: The Swarm architecture simulator☆24Updated 2 years ago
- The Splash-3 benchmark suite☆42Updated last year
- ☆33Updated 2 years ago
- A Speculation-Aware Collaborative Dependence Analysis Framework☆28Updated 7 months ago
- HeteroHalide: From Image Processing DSL to Efficient FPGA Acceleration☆15Updated 4 years ago
- This adds partial support of AVX2 and AVX-512 to gem5.☆13Updated last year
- Creating beautiful gem5 simulations☆47Updated 3 years ago
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆23Updated 2 months ago
- ETHZ Heterogeneous Accelerated Compute Cluster.☆31Updated this week
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 5 years ago
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- ☆13Updated 3 years ago
- EQueue Dialect☆40Updated 3 years ago
- HeteroCL-MLIR dialect for accelerator design☆41Updated 5 months ago
- Race detector for NVIDIA GPUs, published in SOSP 2021.☆19Updated 8 months ago
- agile hardware-software co-design☆47Updated 3 years ago
- compiling DSLs to high-level hardware instructions☆22Updated 2 years ago
- ☆23Updated 4 years ago
- ☆30Updated 2 years ago
- Bridging polyhedral analysis tools to the MLIR framework☆107Updated last year
- The Chronos FPGA Framework to accelerate ordered applications☆22Updated 4 years ago
- Languages, Tools, and Techniques for Accelerator Design☆33Updated 3 years ago
- Productive and portable performance programming across spatial architectures (FPGAs, etc.) and vector architectures (GPUs, etc.)☆31Updated 9 months ago
- ☆24Updated last year
- A GPU FP32 computation method with Tensor Cores.☆20Updated 2 years ago