SimeonEhrig / CUDA-Runtime-Interpreter
It's a prototype for an interpreter, which can interpret the host code of a CUDA Program, written with the runtime API.
☆9Updated 5 years ago
Alternatives and similar repositories for CUDA-Runtime-Interpreter:
Users that are interested in CUDA-Runtime-Interpreter are comparing it to the libraries listed below
- The repository contains container recipes to build the entire stack of Xeus-Cling and Cling including cuda extension with just a few comm…☆9Updated 4 years ago
- cmake-easyinstall git+https://github.com/org/repo.git☆25Updated last year
- tokenizer and parser for circle projects☆11Updated 5 years ago
- Cooperative Primitives for CUDA C++ Kernel Authors. This repository contains CUB PRs from Q4 2019 until Q4 2020.☆22Updated 4 years ago
- C++ User interface for the Platform independent Library Alpaka☆37Updated 6 months ago
- CuPy Benchmark☆12Updated 5 years ago
- An alternative to Boost.MPI for a user friendly C++ interface for MPI (MPICH).☆19Updated 7 years ago
- Library for the GPU-accelerated spatial indexing and processing of particles in 2D and 3D with OpenCL. Currently offers trees based on sp…☆25Updated 7 months ago
- Easy to use benchmarks for linear algebra frameworks☆24Updated 4 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- This repository contains components that will support percolation via OpenCL and CUDA☆31Updated 3 years ago
- GPU Automatically Tuned Linear Algebra Software☆28Updated 9 years ago
- Range-based for loops to iterate over a range of numbers or values☆35Updated 8 years ago
- The Hybrid Task Graph Scheduler API☆40Updated 3 years ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆36Updated last week
- VIGRA2 based on xtensor☆10Updated 6 years ago
- Project ARES represents a joint effort between LANL and ORNL to introduce a common compiler representation and tool-chain for HPC applica…☆10Updated 8 years ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- An implementation of ARMCI using MPI one-sided communication (RMA)☆14Updated 5 months ago
- Resource-based, Declarative task-Graphs for Parallel, Event-driven Scheduling☆21Updated 4 months ago
- Scientific algorithms implemented on top of the x-stack (xtensor, xsimd ...)☆9Updated 5 years ago
- Legate Hello World Pedagogical Library☆10Updated last year
- ☆14Updated 2 years ago
- ☆17Updated last week
- ☆11Updated 2 years ago
- associative floating point addition☆17Updated 10 months ago
- Library for length agnostic SIMD intrinsic support and the corresponding math operations☆20Updated 3 years ago
- Yaksa: High-performance Noncontiguous Data Management☆13Updated 5 months ago
- C++ Header-Only Library for High-Performance Tensor-Vector Multiplication☆21Updated 2 months ago
- ☆10Updated 2 years ago