mangpo / swizzle-inventorLinks
A framework that helps implementing swizzle GPU kernels
☆42Updated 5 years ago
Alternatives and similar repositories for swizzle-inventor
Users that are interested in swizzle-inventor are comparing it to the libraries listed below
Sorting:
- ☆64Updated 6 years ago
- CUDAAdvisor: a GPU profiling tool☆49Updated 6 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆112Updated 2 months ago
- Library to plot integer sets and maps☆49Updated 8 years ago
- Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)☆126Updated 2 years ago
- ☆30Updated 2 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆30Updated 9 months ago
- Instruction THroughput Estimator using MAchine Learning (ITHEMAL)☆148Updated 3 years ago
- Bridging polyhedral analysis tools to the MLIR framework☆113Updated last year
- ☆52Updated 5 years ago
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆38Updated 3 years ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated last year
- development repository for the open earth compiler☆80Updated 4 years ago
- Sample programs for the LLVM PTX back-end☆40Updated 9 years ago
- Conversions to MLIR EmitC☆129Updated 7 months ago
- RV: A Unified Region Vectorizer for LLVM☆111Updated last month
- Polyhedral Extraction Tool (source repository: http://repo.or.cz/w/pet.git)☆39Updated 2 years ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆81Updated 5 years ago
- A Benchmark Suite for Heterogeneous System Computation☆53Updated 4 months ago
- Python wrapper for isl, an integer set library☆77Updated this week
- Flexible GPGPU instrumentation☆88Updated 5 years ago
- An experimental ahead of time compiler for Relay.☆50Updated 5 years ago
- Tapir extension to LLVM for optimizing Parallel Programs☆134Updated 5 years ago
- Integer Set Library (source repository: http://repo.or.cz/w/isl.git)☆70Updated 5 months ago
- Evaluating different memory managers for dynamic GPU memory☆25Updated 4 years ago
- An out-of-tree MLIR dialect template.☆103Updated 10 months ago
- The StreamIt compiler infrastructure.☆71Updated 8 years ago
- assembler for NVIDIA FERMI. Imported from Google Code☆72Updated 10 years ago
- IMPORTANT NOTICE: This implementation is long outdated. Whole-Function Vectorization is an algorithm that transforms a scalar function in…☆22Updated 13 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆106Updated 7 years ago