SwarmArch / T4Links
Code released to accompany the ISCA paper: "T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware"
☆28Updated 3 years ago
Alternatives and similar repositories for T4
Users that are interested in T4 are comparing it to the libraries listed below
Sorting:
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆23Updated 7 months ago
- Polyhedral High-Level Synthesis in MLIR☆33Updated 2 years ago
- ☆14Updated 5 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 6 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆30Updated 9 months ago
- Creating beautiful gem5 simulations☆49Updated 4 years ago
- EQueue Dialect☆40Updated 3 years ago
- ordspecsim: The Swarm architecture simulator☆24Updated 2 years ago
- A PIM instrumentation, compilation, execution, simulation, and evaluation repository for BLIMP-style architectures.☆18Updated 3 years ago
- Tutorial Material from the SST Team☆21Updated last month
- ☆38Updated 3 years ago
- ETHZ Heterogeneous Accelerated Compute Cluster.☆36Updated 3 months ago
- SST Architectural Simulation Components and Libraries☆96Updated last week
- HeteroCL-MLIR dialect for accelerator design☆41Updated 9 months ago
- The Splash-3 benchmark suite☆44Updated 2 years ago
- HeteroHalide: From Image Processing DSL to Efficient FPGA Acceleration☆15Updated 4 years ago
- This adds partial support of AVX2 and AVX-512 to gem5.☆15Updated last year
- Bridging polyhedral analysis tools to the MLIR framework☆113Updated last year
- SparseP is the first open-source Sparse Matrix Vector Multiplication (SpMV) software package for real-world Processing-In-Memory (PIM) ar…☆76Updated 3 years ago
- Artifact, reproducibility, and testing utilites for gem5☆22Updated 4 years ago
- A unified programming framework for high and portable performance across FPGAs and GPUs☆11Updated 3 months ago
- ☆21Updated 4 months ago
- The Chronos FPGA Framework to accelerate ordered applications☆22Updated 5 years ago
- ☆30Updated 2 years ago
- ☆17Updated 3 years ago
- ☆24Updated 4 years ago
- Productive and portable performance programming across spatial architectures (FPGAs, etc.) and vector architectures (GPUs, etc.)☆31Updated last year
- CUDAAdvisor: a GPU profiling tool☆49Updated 6 years ago
- A novel spatial accelerator for horizontal diffusion weather stencil computation, as described in ICS 2023 paper by Singh et al. (https:/…☆20Updated last year
- ☆64Updated 6 years ago