mangpo / swizzle-inventorLinks
A framework that helps implementing swizzle GPU kernels
☆42Updated 5 years ago
Alternatives and similar repositories for swizzle-inventor
Users that are interested in swizzle-inventor are comparing it to the libraries listed below
Sorting:
- Sample programs for the LLVM PTX back-end☆39Updated 9 years ago
- ☆61Updated 6 years ago
- Library to plot integer sets and maps☆49Updated 8 years ago
- CUDAAdvisor: a GPU profiling tool☆49Updated 6 years ago
- ☆30Updated 2 years ago
- Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)☆126Updated 2 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆111Updated 2 months ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated last year
- development repository for the open earth compiler☆80Updated 4 years ago
- Data Dependence Analyzer in the Polyhedral Model☆20Updated last year
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆30Updated 9 months ago
- ☆52Updated 5 years ago
- Polyhedral Extraction Tool (source repository: http://repo.or.cz/w/pet.git)☆39Updated 2 years ago
- Bridging polyhedral analysis tools to the MLIR framework☆113Updated last year
- A repository to test dialects defined dynamically.☆12Updated 2 years ago
- Python wrapper for isl, an integer set library☆77Updated this week
- An experimental ahead of time compiler for Relay.☆50Updated 5 years ago
- Declarative MLIR compilers in Python!☆35Updated 4 years ago
- Chunky Loop Interaction☆24Updated 5 years ago
- Conversions to MLIR EmitC☆128Updated 6 months ago
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆38Updated 3 years ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆81Updated 5 years ago
- ☆35Updated 3 years ago
- Data-Centric MLIR dialect☆42Updated last year
- IMPORTANT NOTICE: This implementation is long outdated. The new libwfv will be released soon. Whole-Function Vectorization is an algorith…☆23Updated 13 years ago
- RV: A Unified Region Vectorizer for LLVM☆110Updated 2 weeks ago
- A fast and highly scalable GPU dynamic memory allocator☆104Updated 10 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆35Updated 5 years ago
- A library to benchmark CUDA code, similar to google benchmark.☆29Updated 4 years ago
- Evaluating different memory managers for dynamic GPU memory☆25Updated 4 years ago