karrenberg / wfvLinks

IMPORTANT NOTICE: This implementation is long outdated. Whole-Function Vectorization is an algorithm that transforms a scalar function in such a way that it computes W executions of the original code in parallel using SIMD instructions (W is the target architecture's SIMD width).

☆22

Alternatives and similar repositories for wfv

Users that are interested in wfv are comparing it to the libraries listed below

Sorting:

cdl-saarland / rv
RV: A Unified Region Vectorizer for LLVM
☆111Updated last month
ROCm / LLVM-AMDGPU-Assembler-Extra
LLVM AMDGPU Assembler Helper Tools
☆113Updated 8 years ago
ogiroux / freestanding
☆70Updated 5 years ago
cpc / hipcl
OpenCL/SPIR-V implementation of HIP
☆104Updated 2 years ago
mangpo / swizzle-inventor
A framework that helps implementing swizzle GPU kernels
☆42Updated 5 years ago
jholewinski / llvm-ptx-samples
Sample programs for the LLVM PTX back-end
☆40Updated 9 years ago
intel / cm-compiler
☆151Updated last week
nvidia-compiler-sdk / nvvmir-samples
☆75Updated 2 years ago
ROCm / ROCm-ComputeABI-Doc
ROCm - AMDGPU Compute Application Binary Interface
☆41Updated 3 years ago
eruffaldi / cppPosit
c++ posit implementation
☆44Updated last year
4vtomat / HTM
a Halide language To MLIR compiler.
☆26Updated 3 years ago
apc-llc / nvcc-llvm-ir
Enabling on-the-fly manipulations with LLVM IR code of CUDA sources
☆112Updated 3 months ago
SunsetQuest / CudaPAD
CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.
☆119Updated 2 years ago
KhronosGroup / SYCL-CTS
SYCL Conformance Tests
☆70Updated this week
revec / VectorBench
Benchmarks for auto-vectorization and revectorization, including both hand-vectorized and scalar code
☆28Updated 6 years ago
ProGTX / sycl-gtx
Implementation of the SYCL specification.
☆66Updated last year
KhronosGroup / LLVM-SPIRV-Backend
An LLVM backend generating SPIR-V binary.
☆88Updated last year
intel / vc-intrinsics
☆58Updated last month
laanwj / decuda
Decuda and cudasm, the CUDA binary utilities package. Low-level tools for NVidia G80 GPUs.
☆102Updated 15 years ago
sderek / CUDAAdvisor
CUDAAdvisor: a GPU profiling tool
☆49Updated 6 years ago
mc-imperial / gpuverify
GPUVerify: a Verifier for GPU Kernels
☆63Updated 2 years ago
upenn-acg / ocolos-public
Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.
☆52Updated last month
canonizer / halloc
A fast and highly scalable GPU dynamic memory allocator
☆105Updated 10 years ago
ithemal / Ithemal
Instruction THroughput Estimator using MAchine Learning (ITHEMAL)
☆148Updated 3 years ago
gleisonsdm / DawnCC-Compiler
A source-to-source compiler for automatic parallelization of C programs through code annotation.
☆62Updated 5 years ago
opencompl / dyn-dialect
A repository to test dialects defined dynamically.
☆12Updated 2 years ago
Xilinx / triSYCL
Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
☆77Updated 4 years ago
unisa-hpc / sycl-bench
SYCL Benchmark Suite
☆65Updated last month
KhronosGroup / SYCL_Reference
SYCL Reference Manual
☆28Updated last year
intel / opencl-clang
☆141Updated last week