ithemal / Ithemal
Instruction THroughput Estimator using MAchine Learning (ITHEMAL)
☆146Updated 3 years ago
Alternatives and similar repositories for Ithemal:
Users that are interested in Ithemal are comparing it to the libraries listed below
- A framework that helps implementing swizzle GPU kernels☆41Updated 5 years ago
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆130Updated last year
- Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)☆125Updated 2 years ago
- RV: A Unified Region Vectorizer for LLVM☆107Updated 2 months ago
- Library to plot integer sets and maps☆49Updated 8 years ago
- Conversions to MLIR EmitC☆128Updated 4 months ago
- Bridging polyhedral analysis tools to the MLIR framework☆109Updated last year
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆38Updated 3 years ago
- Flexible GPGPU instrumentation☆86Updated 5 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 5 years ago
- ☆51Updated 5 years ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆81Updated 5 years ago
- Pluto: An automatic polyhedral parallelizer and locality optimizer☆288Updated 2 weeks ago
- ☆53Updated 5 years ago
- Tapir extension to LLVM for optimizing Parallel Programs☆133Updated 4 years ago
- Polyhedral Extraction Tool (source repository: http://repo.or.cz/w/pet.git)☆39Updated 2 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆110Updated 2 years ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆134Updated this week
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated last year
- ☆38Updated 3 years ago
- ☆240Updated 2 months ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆70Updated 8 years ago
- An out-of-tree MLIR dialect template.☆101Updated 7 months ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆28Updated 6 months ago
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- An MLIR frontend for tensor expressions☆24Updated 4 years ago
- development repository for the open earth compiler☆79Updated 4 years ago
- TPP experimentation on MLIR for linear algebra☆126Updated this week
- The Splash-3 benchmark suite☆43Updated last year
- The quantitative performance comparison among DL compilers on CNN models.☆74Updated 4 years ago