intel / vc-intrinsicsLinks
☆58Updated 2 weeks ago
Alternatives and similar repositories for vc-intrinsics
Users that are interested in vc-intrinsics are comparing it to the libraries listed below
Sorting:
- ☆153Updated this week
- SYCL Conformance Tests☆70Updated 3 weeks ago
- SYCL Reference Manual☆28Updated last year
- ☆141Updated last week
- SYCL Open Source Specification☆136Updated 3 weeks ago
- Intel® GPU Compute Samples☆109Updated 3 weeks ago
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆123Updated 2 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆56Updated 6 months ago
- RV: A Unified Region Vectorizer for LLVM☆112Updated 4 months ago
- SYCL Benchmark Suite☆65Updated 3 months ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆43Updated 3 years ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆84Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆123Updated this week
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- An implementation of HIP that works on CPUs, across OSes.☆126Updated last year
- ROCm Device Libraries☆96Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆119Updated this week
- LLVM AMDGPU Assembler Helper Tools☆112Updated 8 years ago
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆58Updated 2 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆84Updated last week
- ☆16Updated 4 years ago
- ☆92Updated this week
- Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research☆109Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆175Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆152Updated last week
- Intel® Instrumentation and Tracing Technology (ITT) and Just-In-Time (JIT) APIs☆121Updated last week
- ☆64Updated 6 years ago
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆47Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆145Updated this week
- AMD’s C++ library for accelerating tensor primitives☆46Updated this week