SimeonEhrig / CUDA-Runtime-Interpreter
It's a prototype for an interpreter, which can interpret the host code of a CUDA Program, written with the runtime API.
☆9Updated 5 years ago
Alternatives and similar repositories for CUDA-Runtime-Interpreter:
Users that are interested in CUDA-Runtime-Interpreter are comparing it to the libraries listed below
- The repository contains container recipes to build the entire stack of Xeus-Cling and Cling including cuda extension with just a few comm…☆9Updated 4 years ago
- An alternative to Boost.MPI for a user friendly C++ interface for MPI (MPICH).☆19Updated 7 years ago
- CuPy Benchmark☆12Updated 5 years ago
- ☆14Updated 2 years ago
- Library for the GPU-accelerated spatial indexing and processing of particles in 2D and 3D with OpenCL. Currently offers trees based on sp…☆26Updated 8 months ago
- Legate Hello World Pedagogical Library☆10Updated 2 years ago
- Exploring using stdpar and Cython☆33Updated 4 years ago
- Yaksa: High-performance Noncontiguous Data Management☆13Updated 6 months ago
- tokenizer and parser for circle projects☆11Updated 5 years ago
- Benchmark Suite for Heterogenuous FFT Implementations☆35Updated last year
- Standalone Patatrack pixel tracking☆17Updated 6 months ago
- Dynamic execution environments for coupled, thread-heterogeneous MPI+X applications☆21Updated last month
- The parallel API to be utilized by AllScale projects to express parallelism.☆9Updated 6 years ago
- Mirror of https://gitlab.kitware.com/vtk/vtk-m☆31Updated this week
- This repository contains components that will support percolation via OpenCL and CUDA☆32Updated 3 years ago
- GPU Optimization and Memory Abstraction Framework☆32Updated 5 years ago
- A thread safe simple C++ wrapper for FFTW & MKL☆15Updated 3 years ago
- An implementation of ARMCI using MPI one-sided communication (RMA)☆14Updated 6 months ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆43Updated this week
- ☆11Updated 2 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- ☆11Updated 5 years ago
- Binomial model☆12Updated 5 years ago
- Scientific algorithms implemented on top of the x-stack (xtensor, xsimd ...)☆9Updated 5 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- GPU Automatically Tuned Linear Algebra Software☆28Updated 9 years ago
- Sorting libraries for pyculib☆14Updated 6 years ago
- CMake FindLAPACK.cmake that works with Intel MKL, Atlas, OpenBLAS, Netlib, LAPACK95 for C / C++ / Fortran☆15Updated 2 years ago
- Library for length agnostic SIMD intrinsic support and the corresponding math operations☆20Updated 3 years ago
- Generic exascale-ready library for halo-exchange operations on variety of grids/meshes☆10Updated 3 weeks ago