nitro-tuner / nitroLinks
Nitro Autotuning Framework
β9Updated 8 years ago
Alternatives and similar repositories for nitro
Users that are interested in nitro are comparing it to the libraries listed below
Sorting:
- Intel Heterogeneous Research Compiler (iHRC)β25Updated 2 years ago
- π "Synthesizing Benchmarks for Predictive Modeling" (π₯ CGO'17 Best Paper)β22Updated 2 years ago
- The OpenDwarfs project provides a benchmark suite consisting of different computation/communication idioms, i.e., dwarfs, for state-of-arβ¦β99Updated 5 years ago
- GPUVerify: a Verifier for GPU Kernelsβ62Updated 2 years ago
- The SparseX sparse kernel optimization libraryβ39Updated 6 years ago
- A Benchmark Suite for Heterogeneous System Computationβ53Updated 4 months ago
- GraphMat graph analytics frameworkβ102Updated 2 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUsβ35Updated 5 years ago
- Flexible GPGPU instrumentationβ87Updated 5 years ago
- Compute applications.β24Updated 5 years ago
- Orio is an open-source extensible framework for the definition of domain-specific languages and generation of optimized code for multipleβ¦β37Updated 3 years ago
- CUDAAdvisor: a GPU profiling toolβ49Updated 6 years ago
- Benchmark for Co-running Single Applications on Integrated Architecturesβ12Updated 8 years ago
- GPU Optimization and Memory Abstraction Frameworkβ32Updated 5 years ago
- Library to plot integer sets and mapsβ49Updated 8 years ago
- A domain-specific language and compiler for image processingβ76Updated 4 years ago
- Chaiβ44Updated last year
- sparse matrix pre-processing libraryβ82Updated last year
- Program analysis tool based on software performance countersβ57Updated 4 years ago
- Loop Kernel Analysis and Performance Modeling Toolkitβ93Updated 3 months ago
- Reference workloads for modern deep learning methods.β73Updated 2 years ago
- CUDA and OpenMP implementations of C2R/R2C inplace transpositionβ46Updated 10 years ago
- a heterogeneous multiGPU level-3 BLAS libraryβ45Updated 5 years ago
- Asynchronous Multi-GPU Programming Frameworkβ46Updated 4 years ago
- Chunky Loop Analyzer: A Polyhedral Representation Extraction Tool for High Level Programsβ24Updated 2 years ago
- A Distributed Multi-GPU System for Fast Graph Processingβ65Updated 6 years ago
- Artifact of paper "Exploiting Recent SIMD Architectural Advances for Irregular Applications"β11Updated 9 years ago
- OpenCL tool to detect buffer overflows in GPU kernelsβ21Updated 6 years ago
- A tuning assistant tool to find a lower floating-point precision that can be used in any part of a program. Precimonious performs a searcβ¦β35Updated 8 years ago
- β34Updated 3 years ago