oneapi-src / oneAPI-samples
Samples for Intel® oneAPI Toolkits
☆961Updated this week
Related projects ⓘ
Alternatives and complementary repositories for oneAPI-samples
- oneAPI Math Kernel Library (oneMKL) Interfaces☆624Updated this week
- SYCL Academy, a set of learning materials for SYCL heterogeneous programming☆459Updated this week
- Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…☆252Updated last month
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆202Updated last week
- CUDA Kernel Benchmarking Library☆519Updated this week
- Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C+…☆1,397Updated this week
- CUDA Core Compute Libraries☆1,299Updated this week
- ☆233Updated this week
- oneAPI Specification source files☆191Updated 2 weeks ago
- STREAM, for lots of devices written in many programming models☆326Updated 2 months ago
- Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.☆1,259Updated this week
- Intel® Extension for TensorFlow*☆320Updated last month
- HIPIFY: Convert CUDA to Portable C++ Code☆523Updated this week
- oneAPI Level Zero Specification Headers and Loader☆222Updated this week
- ☆486Updated this week
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆308Updated this week
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆561Updated 3 weeks ago
- ☆218Updated last week
- cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it☆455Updated last month
- An Awesome list of oneAPI projects☆126Updated 3 months ago
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆850Updated this week
- An implementation of BLAS using the SYCL open standard.☆259Updated 3 weeks ago
- Next generation BLAS implementation for ROCm platform☆346Updated this week
- Main Book repository for the Parallel and High Performance Computing book, Manning Publications☆176Updated 2 years ago
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆315Updated this week
- RAJA Performance Portability Layer (C++)☆491Updated this week
- Examples for HIP☆200Updated 2 weeks ago
- QUDA is a library for performing calculations in lattice QCD on GPUs.☆294Updated this week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆92Updated 2 years ago
- collection of benchmarks to measure basic GPU capabilities☆264Updated 5 months ago