iitd-plos / unicorn
Unicorn - An HPC Library for hybrid CPU-GPU clusters (TPDS 2016 paper)
☆12Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for unicorn
- ibmgraphblas☆27Updated 6 years ago
- DSL for stencils and image processing☆13Updated 8 years ago
- Apollo: Online Machine Learning for Performance Portability☆22Updated 2 months ago
- compiler for fortran stencils using verified lifting,☆17Updated 2 years ago
- A Sound and Complete Verification Tool for Warp-Specialized GPU Kernels☆18Updated 9 years ago
- Python bindings for libNVVM☆37Updated 10 years ago
- Intel Heterogeneous Research Compiler (iHRC)☆25Updated last year
- Extended Roofline Model - LLVM source tree with additional libraries for the analysis of the dynamic execution in the interpreter☆17Updated 7 years ago
- Simplified Interface to Complex Memory☆26Updated last year
- TAU Performance System Public Mirror (Updated every night at midnight, USA Pacific Time)☆39Updated 2 weeks ago
- Artifact of paper "Exploiting Recent SIMD Architectural Advances for Irregular Applications"☆11Updated 8 years ago
- A tuning assistant tool to find a lower floating-point precision that can be used in any part of a program. Precimonious performs a searc…☆34Updated 8 years ago
- Reference implementation of Deep Neural Network primitives using LIBXSMM's Tensor Processing Primitives (TPP)☆12Updated 3 months ago
- Orio is an open-source extensible framework for the definition of domain-specific languages and generation of optimized code for multiple…☆36Updated 3 years ago
- Official BOLT Repository☆27Updated 3 months ago
- Scout -- Domain Specific Language & Toolchain☆15Updated 8 years ago
- The implementation of the Elevate language☆29Updated 3 weeks ago
- Experimental Linear Algebra Performance Studies☆12Updated 7 years ago
- Heterogeneous Active Messages C++ library☆21Updated 5 years ago
- A CUDA implementation of the PageRank Pipeline Benchmark☆32Updated 7 years ago
- The rep contains my experiments with state of the art NVM programming abstractions during my internship at Regal Lab of Inria Paris under…☆11Updated 4 years ago
- Visualization tool for analyzing call trees and graphs☆31Updated last year
- A copy of the Intel Cilk Plus runtime system with modifications to work with OpenCilk and its associated tools.☆12Updated 3 years ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆28Updated last year
- Configurable Runtime Analysis for Floating-point Tuning☆12Updated 4 years ago
- thread-safe sparse matrix data structure☆25Updated 10 years ago
- The SparseX sparse kernel optimization library☆39Updated 5 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆80Updated this week
- A mirror of cinch's internal gitlab repository.☆22Updated 2 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago