mangpo / swizzle-inventorLinks
A framework that helps implementing swizzle GPU kernels
☆41Updated 5 years ago
Alternatives and similar repositories for swizzle-inventor
Users that are interested in swizzle-inventor are comparing it to the libraries listed below
Sorting:
- CUDAAdvisor: a GPU profiling tool☆49Updated 6 years ago
- ☆54Updated 6 years ago
- Library to plot integer sets and maps☆49Updated 8 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆111Updated last month
- Sample programs for the LLVM PTX back-end☆37Updated 9 years ago
- Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)☆125Updated 2 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆30Updated 8 months ago
- Bridging polyhedral analysis tools to the MLIR framework☆111Updated last year
- An experimental ahead of time compiler for Relay.☆50Updated 5 years ago
- ☆29Updated 2 years ago
- Python wrapper for isl, an integer set library☆77Updated this week
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆81Updated 5 years ago
- development repository for the open earth compiler☆80Updated 4 years ago
- Intel Heterogeneous Research Compiler (iHRC)☆25Updated 2 years ago
- ☆51Updated 5 years ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated last year
- Polyhedral Extraction Tool (source repository: http://repo.or.cz/w/pet.git)☆39Updated 2 years ago
- IMPORTANT NOTICE: This implementation is long outdated. The new libwfv will be released soon. Whole-Function Vectorization is an algorith…☆22Updated 13 years ago
- Evaluating different memory managers for dynamic GPU memory☆25Updated 4 years ago
- Data Dependence Analyzer in the Polyhedral Model☆20Updated last year
- LonestarGPU: Irregular algorithms parallelized for GPUs☆35Updated 5 years ago
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆38Updated 3 years ago
- Chunky Loop Interaction☆24Updated 5 years ago
- assembler for NVIDIA FERMI. Imported from Google Code☆72Updated 10 years ago
- Declarative MLIR compilers in Python!☆35Updated 4 years ago
- Conversions to MLIR EmitC☆128Updated 5 months ago
- GPUVerify: a Verifier for GPU Kernels☆62Updated 2 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 5 years ago
- ☆35Updated 3 years ago
- RV: A Unified Region Vectorizer for LLVM☆108Updated last week