SwarmArch / T4Links
Code released to accompany the ISCA paper: "T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware"
☆28Updated 3 years ago
Alternatives and similar repositories for T4
Users that are interested in T4 are comparing it to the libraries listed below
Sorting:
- ☆14Updated 5 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆30Updated 8 months ago
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆23Updated 6 months ago
- Polyhedral High-Level Synthesis in MLIR☆31Updated 2 years ago
- ☆29Updated 2 years ago
- ordspecsim: The Swarm architecture simulator☆24Updated 2 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 5 years ago
- HeteroGen: transpiling C to heterogeneous HLS code with automated test generation and program repair (ASPLOS 2022)☆17Updated 8 months ago
- ☆35Updated 3 years ago
- A Speculation-Aware Collaborative Dependence Analysis Framework☆28Updated 11 months ago
- Productive and portable performance programming across spatial architectures (FPGAs, etc.) and vector architectures (GPUs, etc.)☆31Updated last year
- This adds partial support of AVX2 and AVX-512 to gem5.☆15Updated last year
- HeteroCL-MLIR dialect for accelerator design☆40Updated 8 months ago
- A unified programming framework for high and portable performance across FPGAs and GPUs☆11Updated 2 months ago
- A PIM instrumentation, compilation, execution, simulation, and evaluation repository for BLIMP-style architectures.☆18Updated 3 years ago
- The Splash-3 benchmark suite☆44Updated 2 years ago
- compiling DSLs to high-level hardware instructions☆22Updated 2 years ago
- HeteroHalide: From Image Processing DSL to Efficient FPGA Acceleration☆15Updated 4 years ago
- Creating beautiful gem5 simulations☆49Updated 4 years ago
- CUDAAdvisor: a GPU profiling tool☆49Updated 6 years ago
- EQueue Dialect☆40Updated 3 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 5 years ago
- agile hardware-software co-design☆47Updated 3 years ago
- Bridging polyhedral analysis tools to the MLIR framework☆111Updated last year
- ☆41Updated 2 weeks ago
- Tutorial Material from the SST Team☆19Updated last year
- ☆17Updated 3 years ago
- ETHZ Heterogeneous Accelerated Compute Cluster.☆34Updated 2 months ago
- ☆21Updated 3 months ago
- ☆25Updated last year