quic / software-kit-for-qualcomm-cloud-ai-100Links
Software kit for Qualcomm Cloud AI 100
☆18Updated 5 months ago
Alternatives and similar repositories for software-kit-for-qualcomm-cloud-ai-100
Users that are interested in software-kit-for-qualcomm-cloud-ai-100 are comparing it to the libraries listed below
Sorting:
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆43Updated 4 years ago
- ☆28Updated 4 years ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆136Updated 9 months ago
- Example for running IREE in a bare-metal Arm environment.☆39Updated 3 months ago
- SYCL Reference Manual☆28Updated last year
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆84Updated this week
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)☆35Updated 3 months ago
- ☆94Updated last week
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆56Updated 7 months ago
- The Riallto Open Source Project from AMD☆84Updated 6 months ago
- SYCL for Vitis: Experimental fusion of triSYCL with Intel SYCL oneAPI DPC++ up-streaming effort into Clang/LLVM☆121Updated 11 months ago
- Tenstorrent Kernel Module☆54Updated this week
- SYCL Benchmark Suite☆65Updated 4 months ago
- Intel® GPU Compute Samples☆109Updated last month
- oneAPI Data Parallel C++ (DPC++) language reference☆26Updated 2 years ago
- ☆19Updated 3 weeks ago
- MLPerf™ Mobile models☆26Updated last month
- CMake modules used within the ROCm libraries☆67Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆154Updated this week
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆50Updated this week
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆124Updated 2 years ago
- Fork of LLVM to support AMD AIEngine processors☆171Updated this week
- Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research☆112Updated 2 years ago
- Tenstorrent Firmware repository☆23Updated last week
- ☆157Updated this week
- Utilities for accessing AMD's Machine-Readable GPU ISA Specifications.☆42Updated last month
- rocWMMA☆136Updated this week
- TVM for Tenstorrent ASICs☆27Updated last month
- ☆25Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆69Updated last week