intel / vc-intrinsics
☆56Updated this week
Alternatives and similar repositories for vc-intrinsics:
Users that are interested in vc-intrinsics are comparing it to the libraries listed below
- ☆150Updated this week
- SYCL Reference Manual☆27Updated 10 months ago
- SYCL Conformance Tests☆68Updated this week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆106Updated this week
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆37Updated 3 years ago
- SYCL Open Source Specification☆130Updated this week
- ☆138Updated 2 months ago
- SYCL Benchmark Suite☆64Updated last month
- ☆44Updated this week
- RAND library for HIP programming language☆117Updated this week
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated this week
- Intel® GPU Compute Samples☆105Updated 2 weeks ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆129Updated last week
- DO NOT USE : Deprecated : Mirror of AMD llvm-project : The source repo is https://github.com/RadeonOpenCompute/llvm-project. Several ti…☆13Updated last year
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- ☆138Updated this week
- oneAPI Level Zero Conformance & Performance test content☆48Updated this week
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆49Updated 6 months ago
- Reusable software components for ROCm developers☆83Updated this week
- ROCm Device Libraries☆97Updated 10 months ago
- oneAPI Data Parallel C++ (DPC++) language reference☆26Updated 2 years ago
- assembler for NVIDIA FERMI. Imported from Google Code☆72Updated 10 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆110Updated 2 years ago
- Simple OpenCL Samples that Build with Khronos Headers and Libs☆100Updated this week
- Codeplay project for contributions to the LLVM SYCL implementation☆30Updated 4 years ago
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆56Updated 5 months ago
- AMD’s C++ library for accelerating tensor primitives☆39Updated this week
- Emulating DMA Engines on GPUs for Performance and Portability☆39Updated 9 years ago
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆116Updated 2 years ago
- Library with JIT (Just-in-time) compilation support to optimize performance of small and medium matrix multiplication☆14Updated 3 years ago