intel / vc-intrinsics
☆56Updated this week
Alternatives and similar repositories for vc-intrinsics:
Users that are interested in vc-intrinsics are comparing it to the libraries listed below
- SYCL Reference Manual☆27Updated 10 months ago
- SYCL Conformance Tests☆68Updated last week
- ☆149Updated last month
- SYCL Open Source Specification☆130Updated this week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆106Updated this week
- SYCL Benchmark Suite☆63Updated 2 weeks ago
- Tools for parsing, assembling, and disassembling HSAIL.☆71Updated 4 years ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆36Updated 3 years ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆130Updated this week
- RAND library for HIP programming language☆117Updated this week
- oneAPI Data Parallel C++ (DPC++) language reference☆26Updated 2 years ago
- ☆137Updated last month
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆115Updated 2 years ago
- Information about AVX-512 support on recent Intel processors☆44Updated 2 years ago
- Simple OpenCL Samples that Build with Khronos Headers and Libs☆97Updated this week
- ☆43Updated this week
- ☆81Updated this week
- ROCm Parallel Primitives☆170Updated this week
- Reusable software components for ROCm developers☆83Updated this week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆84Updated 2 weeks ago
- ROCm - AMDGPU Compute Application Binary Interface☆41Updated 2 years ago
- ROC profiler library. Profiling with perf-counters and derived metrics.☆135Updated this week
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated last year
- Intel® GPU Compute Samples☆104Updated this week
- ☆52Updated 5 years ago
- Compute Benchmarks for oneAPI Level Zero and OpenCL™ Driver☆34Updated last week
- assembler for NVIDIA FERMI. Imported from Google Code☆72Updated 9 years ago
- ROCm Device Libraries☆97Updated 10 months ago
- Library with JIT (Just-in-time) compilation support to optimize performance of small and medium matrix multiplication☆14Updated 3 years ago