SwarmArch / T4Links
Code released to accompany the ISCA paper: "T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware"
☆28Updated 3 years ago
Alternatives and similar repositories for T4
Users that are interested in T4 are comparing it to the libraries listed below
Sorting:
- ☆15Updated 5 years ago
- Polyhedral High-Level Synthesis in MLIR☆34Updated 2 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆30Updated last year
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆24Updated 10 months ago
- ☆31Updated 2 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 6 years ago
- A PIM instrumentation, compilation, execution, simulation, and evaluation repository for BLIMP-style architectures.☆18Updated 3 years ago
- ETHZ Heterogeneous Accelerated Compute Cluster.☆38Updated 2 weeks ago
- ☆38Updated 3 years ago
- Creating beautiful gem5 simulations☆49Updated 4 years ago
- HeteroHalide: From Image Processing DSL to Efficient FPGA Acceleration☆15Updated 5 years ago
- SST Architectural Simulation Components and Libraries☆103Updated 2 weeks ago
- ordspecsim: The Swarm architecture simulator☆24Updated 2 years ago
- This adds partial support of AVX2 and AVX-512 to gem5.☆15Updated last year
- EQueue Dialect☆39Updated 3 years ago
- ☆18Updated 4 months ago
- A retargetable and extensible synthesis-based compiler for modern hardware architectures☆13Updated 5 months ago
- SparseP is the first open-source Sparse Matrix Vector Multiplication (SpMV) software package for real-world Processing-In-Memory (PIM) ar…☆77Updated 3 years ago
- Bridging polyhedral analysis tools to the MLIR framework☆117Updated 2 years ago
- The Splash-3 benchmark suite☆44Updated 2 years ago
- ☆27Updated 5 years ago
- CUDAAdvisor: a GPU profiling tool☆51Updated 7 years ago
- HeteroCL-MLIR dialect for accelerator design☆41Updated last year
- ☆16Updated last week
- An MLIR-based compiler from C/C++ to AMD-Xilinx Versal AIE☆16Updated 3 years ago
- A unified programming framework for high and portable performance across FPGAs and GPUs☆11Updated 7 months ago
- The Chronos FPGA Framework to accelerate ordered applications☆22Updated 5 years ago
- ☆22Updated 8 months ago
- ☆18Updated 3 years ago
- Tutorial Material from the SST Team☆23Updated 2 months ago