apc-llc / liboffloadmicLinks
Standalone libgomp with MIC backend for explicit CUDA-like Xeon Phi device programming
☆12Updated 9 years ago
Alternatives and similar repositories for liboffloadmic
Users that are interested in liboffloadmic are comparing it to the libraries listed below
Sorting:
- An implementation of ARMCI using MPI one-sided communication (RMA)☆15Updated last year
- Partitioned Global Address Space (PGAS) library for distributed arrays☆106Updated last week
- OpenSHMEM Implementation on MPI☆28Updated 6 months ago
- A unified framework across multiple programming platforms☆41Updated 4 months ago
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆25Updated 4 months ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆48Updated 10 years ago
- Global Memory and Threading runtime system☆25Updated last year
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆73Updated last month
- Classical molecular dynamics proxy application.☆32Updated 5 years ago
- Linux Cross-Memory Attach☆94Updated last year
- QCD for Intel Xeon Phi and Xeon processors☆14Updated last year
- High Performance Linpack for GPUs (Using OpenCL, CUDA, CAL)☆91Updated 9 years ago
- A BUDE virtual-screening benchmark, in many programming models☆29Updated 11 months ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Updated 6 months ago
- sparse matrix pre-processing library☆83Updated last year
- The ultimate bandwidth benchmark☆56Updated this week
- The SparseX sparse kernel optimization library☆42Updated 6 years ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆111Updated 2 years ago
- ☆60Updated 3 years ago
- Official BOLT Repository☆31Updated last year
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆109Updated 2 months ago
- Simplified Interface to Complex Memory☆28Updated 2 years ago
- ☆14Updated 4 years ago
- BLAS-like Library Instantiation Software Framework☆151Updated 2 weeks ago
- Omni Compiler for C and Fortran programs with XcalableMP and OpenACC directives☆61Updated 2 years ago
- ☆58Updated 3 weeks ago
- Parallel Tensor Infrastructure (ParTI!)☆30Updated 5 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆21Updated 7 years ago
- Autonomic Performance Environment for eXascale (APEX)☆49Updated 2 months ago
- High-performance, GPU-aware communication library☆86Updated 9 months ago