SwarmArch / T4Links
Code released to accompany the ISCA paper: "T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware"
☆29Updated 3 years ago
Alternatives and similar repositories for T4
Users that are interested in T4 are comparing it to the libraries listed below
Sorting:
- Polyhedral High-Level Synthesis in MLIR☆33Updated 2 years ago
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆24Updated 8 months ago
- HeteroHalide: From Image Processing DSL to Efficient FPGA Acceleration☆15Updated 4 years ago
- A unified programming framework for high and portable performance across FPGAs and GPUs☆11Updated 4 months ago
- ☆38Updated 3 years ago
- A PIM instrumentation, compilation, execution, simulation, and evaluation repository for BLIMP-style architectures.☆18Updated 3 years ago
- EQueue Dialect☆40Updated 3 years ago
- SST Architectural Simulation Components and Libraries☆96Updated this week
- ☆14Updated 5 years ago
- ordspecsim: The Swarm architecture simulator☆25Updated 2 years ago
- HeteroCL-MLIR dialect for accelerator design☆41Updated 10 months ago
- Bridging polyhedral analysis tools to the MLIR framework☆116Updated last year
- ETHZ Heterogeneous Accelerated Compute Cluster.☆36Updated 4 months ago
- ☆31Updated 3 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 6 years ago
- ☆40Updated 2 weeks ago
- SparseP is the first open-source Sparse Matrix Vector Multiplication (SpMV) software package for real-world Processing-In-Memory (PIM) ar…☆75Updated 3 years ago
- ☆21Updated 5 months ago
- This adds partial support of AVX2 and AVX-512 to gem5.☆15Updated last year
- Tutorial Material from the SST Team☆21Updated 2 months ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆30Updated 10 months ago
- Creating beautiful gem5 simulations☆49Updated 4 years ago
- ☆30Updated 2 years ago
- ☆18Updated last month
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 6 years ago
- PIM-ML is a benchmark for training machine learning algorithms on the UPMEM architecture, which is the first publicly-available real-worl…☆24Updated 6 months ago
- Domain-Specific Architecture Generator 2☆21Updated 2 years ago
- Data-Centric MLIR dialect☆42Updated last year
- ☆24Updated 4 years ago
- Public Release of Stream-Dataflow☆14Updated 6 years ago