oneapi-src / oneAPI-samples
Samples for Intel® oneAPI Toolkits
☆948Updated this week
Related projects ⓘ
Alternatives and complementary repositories for oneAPI-samples
- Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…☆251Updated 3 weeks ago
- oneAPI Math Kernel Library (oneMKL) Interfaces☆620Updated this week
- SYCL Academy, a set of learning materials for SYCL heterogeneous programming☆454Updated this week
- ☆228Updated this week
- oneAPI Specification source files☆190Updated this week
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆201Updated 2 weeks ago
- oneAPI Level Zero Specification Headers and Loader☆218Updated last week
- CUDA Kernel Benchmarking Library☆516Updated 2 weeks ago
- Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.☆1,251Updated this week
- An Awesome list of oneAPI projects☆124Updated 2 months ago
- HIPIFY: Convert CUDA to Portable C++ Code☆523Updated this week
- CUDA Core Compute Libraries☆1,252Updated this week
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆555Updated last week
- CUDA Library Samples☆1,604Updated last week
- STREAM, for lots of devices written in many programming models☆325Updated 2 months ago
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆314Updated last week
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,680Updated last year
- Examples for HIP☆201Updated this week
- Python SYCL bindings and SYCL-based Python Array API library☆101Updated this week
- ☆215Updated this week
- Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C+…☆1,383Updated this week
- OpenCL API, OpenCL C, Extensions, SPIR-V Environment Specs, Ref page, and C++ for OpenCL doc sources.☆359Updated 2 weeks ago
- A collection of examples for the ROCm software stack☆167Updated this week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆517Updated 5 months ago
- cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it☆449Updated 2 weeks ago
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆309Updated this week
- ROCm Communication Collectives Library (RCCL)☆267Updated this week
- RAJA Performance Portability Layer (C++)☆486Updated this week
- Intel® Extension for TensorFlow*☆317Updated last month
- An implementation of BLAS using the SYCL open standard.☆259Updated last week