oneapi-src / SYCLomatic
☆250Updated this week
Alternatives and similar repositories for SYCLomatic:
Users that are interested in SYCLomatic are comparing it to the libraries listed below
- oneAPI Level Zero Specification Headers and Loader☆237Updated this week
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆263Updated last month
- Stretching GPU performance for GEMMs and tensor contractions.☆233Updated this week
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆217Updated this week
- CUDA Kernel Benchmarking Library☆561Updated 3 months ago
- Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…☆259Updated last month
- AMD's graph optimization engine.☆208Updated this week
- GPUOcelot: A dynamic compilation framework for PTX☆166Updated last week
- SYCL Academy, a set of learning materials for SYCL heterogeneous programming☆473Updated 3 weeks ago
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆211Updated this week
- ☆60Updated 2 months ago
- SYCL Open Source Specification☆127Updated last week
- An implementation of HIP that works on CPUs, across OSes.☆115Updated 11 months ago
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆349Updated this week
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆251Updated this week
- OpenAI Triton backend for Intel® GPUs☆165Updated this week
- oneAPI Specification source files☆195Updated this week
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆135Updated this week
- oneAPI Collective Communications Library (oneCCL)☆222Updated 3 weeks ago
- RAND library for HIP programming language☆115Updated this week
- HIPIFY: Convert CUDA to Portable C++ Code☆552Updated this week
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆322Updated 2 weeks ago
- ROCm Parallel Primitives☆169Updated last week
- ☆81Updated this week
- oneAPI Technical Advisory Board (TAB) Meeting Notes☆72Updated last year
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆130Updated this week
- ROCm BLAS marshalling library☆131Updated last week
- rocWMMA☆100Updated this week
- Next generation BLAS implementation for ROCm platform☆359Updated this week
- Examples for HIP☆202Updated 2 months ago