nitro-tuner / nitroLinks
Nitro Autotuning Framework
β9Updated 8 years ago
Alternatives and similar repositories for nitro
Users that are interested in nitro are comparing it to the libraries listed below
Sorting:
- The SparseX sparse kernel optimization libraryβ40Updated 6 years ago
- π "Synthesizing Benchmarks for Predictive Modeling" (π₯ CGO'17 Best Paper)β22Updated 2 years ago
- Intel Heterogeneous Research Compiler (iHRC)β25Updated 2 years ago
- Program analysis tool based on software performance countersβ57Updated 4 years ago
- Flexible GPGPU instrumentationβ88Updated 5 years ago
- The OpenDwarfs project provides a benchmark suite consisting of different computation/communication idioms, i.e., dwarfs, for state-of-arβ¦β98Updated 5 years ago
- Compute applications.β24Updated 5 years ago
- The SHOC Benchmark Suiteβ256Updated 3 years ago
- A Benchmark Suite for Heterogeneous System Computationβ53Updated 5 months ago
- a heterogeneous multiGPU level-3 BLAS libraryβ45Updated 5 years ago
- β34Updated 3 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUsβ35Updated 5 years ago
- Loop Kernel Analysis and Performance Modeling Toolkitβ94Updated 4 months ago
- The Surprisingly ParalleL spArse Tensor Toolkit.β71Updated 3 years ago
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.β39Updated 3 years ago
- GraphMat graph analytics frameworkβ102Updated 2 years ago
- GPUVerify: a Verifier for GPU Kernelsβ63Updated 2 years ago
- Caffe deep learning framework - optimized for Xeon Phiβ14Updated 10 years ago
- CSR-based SpMV on Heterogeneous Processors (Intel Broadwell, AMD Kaveri and nVidia Tegra K1)β27Updated 10 years ago
- OpenCL tool to detect buffer overflows in GPU kernelsβ21Updated 6 years ago
- OpenCL extension for csmith.β24Updated 8 years ago
- sparse matrix pre-processing libraryβ83Updated last year
- A tuning assistant tool to find a lower floating-point precision that can be used in any part of a program. Precimonious performs a searcβ¦β36Updated 8 years ago
- A task benchmarkβ43Updated 11 months ago
- COBAYN: Compiler Autotuning Framework Using Bayesian Networksβ20Updated 3 years ago
- Nanos++ is a runtime designed to serve as runtime support in parallel environments. It is mainly used to support OmpSs, a extension to Oβ¦β37Updated 3 years ago
- Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.β27Updated 6 years ago
- CLTune: An automatic OpenCL & CUDA kernel tunerβ180Updated 2 years ago
- Barcelona OpenMP Task Suite is a collection of applications that allow to test OpenMP tasking implementations and compare its behaviour uβ¦β46Updated 5 years ago
- Chunky Loop Analyzer: A Polyhedral Representation Extraction Tool for High Level Programsβ24Updated 2 years ago