karrenberg / wfv
IMPORTANT NOTICE: This implementation is long outdated. The new libwfv will be released soon. Whole-Function Vectorization is an algorithm that transforms a scalar function in such a way that it computes W executions of the original code in parallel using SIMD instructions (W is the target architecture's SIMD width). This implementation of the a…
☆22Updated 12 years ago
Related projects ⓘ
Alternatives and complementary repositories for wfv
- RV: A Unified Region Vectorizer for LLVM☆105Updated 3 weeks ago
- Sample programs for the LLVM PTX back-end☆34Updated 9 years ago
- A framework that helps implementing swizzle GPU kernels☆41Updated 4 years ago
- Data Dependence Analyzer in the Polyhedral Model☆19Updated last year
- ☆27Updated last year
- Library to plot integer sets and maps☆47Updated 7 years ago
- ☆68Updated 4 years ago
- GPUVerify: a Verifier for GPU Kernels☆58Updated 2 years ago
- LLVM AMDGPU Assembler Helper Tools☆111Updated 7 years ago
- ☆75Updated last year
- A Specification and a Library for Data Exchange in Polyhedral Compilation Tools☆26Updated 3 months ago
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆107Updated last year
- Generates CIL MLIR dialect from C/C++ source.☆31Updated 3 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆98Updated last year
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- Integer Set Library (source repository: http://repo.or.cz/w/isl.git)☆64Updated last year
- ROCm - AMDGPU Compute Application Binary Interface☆40Updated 2 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆42Updated 5 years ago
- a Halide language To MLIR compiler.☆26Updated 3 years ago
- Tools for parsing, assembling, and disassembling HSAIL.☆70Updated 4 years ago
- c++ posit implementation☆43Updated last year
- Benchmarks for auto-vectorization and revectorization, including both hand-vectorized and scalar code☆25Updated 5 years ago
- ☆47Updated 5 years ago
- Develop toolchain based on llvm to for Cpu0 processor☆46Updated 10 months ago
- CERE: Codelet Extractor and REplayer☆41Updated last year
- Bridging polyhedral analysis tools to the MLIR framework☆102Updated last year
- A source-to-source compiler for automatic parallelization of C programs through code annotation.☆61Updated 4 years ago
- Tutorial for LLVM Dev Conference 2019.☆14Updated 5 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆42Updated 10 months ago
- ☆53Updated last week