nadavrot / pgo_ml
Source code for the paper "Profile Guided Optimization without Profiles: A Machine Learning Approach"
☆23Updated 3 years ago
Alternatives and similar repositories for pgo_ml:
Users that are interested in pgo_ml are comparing it to the libraries listed below
- ☆28Updated 2 years ago
- Collaborative Parallelization Framework (CPF)☆32Updated last year
- compiling DSLs to high-level hardware instructions☆22Updated 2 years ago
- A translation validation framework for MLIR☆81Updated last week
- Tutorial for LLVM Dev Conference 2019.☆15Updated 5 years ago
- Benchmarks for auto-vectorization and revectorization, including both hand-vectorized and scalar code☆28Updated 6 years ago
- A Speculation-Aware Collaborative Dependence Analysis Framework☆28Updated 8 months ago
- Bridging polyhedral analysis tools to the MLIR framework☆109Updated last year
- outline and links for PLDI 2022 tutorial☆17Updated 2 years ago
- A fast and accurate reuse distance analyzer for multi-threaded applications. It leverages existing hardware features in commodity CPUs.☆16Updated 2 years ago
- Data-Centric MLIR dialect☆40Updated last year
- Website for CS 265☆28Updated 3 months ago
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- A enumerator for MLIR, relying on the information given by IRDL.☆19Updated last week
- Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.☆52Updated last year
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆84Updated this week
- Updated C version of the Test Suite for Vectorising Compilers☆56Updated last year
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆38Updated 3 years ago
- Polyhedral Extraction Tool (source repository: http://repo.or.cz/w/pet.git)☆39Updated 2 years ago
- UB-aware interpreter for LLVM debugging☆26Updated this week
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 5 years ago
- A compiler to automatically transform applications into disaggregated memory apps.☆16Updated last year
- TPP experimentation on MLIR for linear algebra☆121Updated last week
- Library to plot integer sets and maps☆49Updated 8 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆28Updated 6 months ago
- Interprocedural Basic Block Code Layout Optimization☆18Updated 6 years ago
- A GPU FP32 computation method with Tensor Cores.☆20Updated 2 years ago
- Code released to accompany the ISCA paper: "T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware"☆28Updated 3 years ago
- HeteroCL-MLIR dialect for accelerator design☆41Updated 6 months ago
- ☆53Updated 5 years ago