quic / software-kit-for-qualcomm-cloud-ai-100
Software kit for Qualcomm Cloud AI 100
☆17Updated last month
Related projects ⓘ
Alternatives and complementary repositories for software-kit-for-qualcomm-cloud-ai-100
- Software kit for Qualcomm Cloud AI 100 cc☆10Updated 9 months ago
- ☆18Updated 3 years ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆33Updated 3 years ago
- a clone of POCL that includes RISC-V newlib devices support and Vortex☆37Updated 5 months ago
- ☆129Updated this week
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆105Updated 3 months ago
- MLPerf™ Mobile models☆24Updated last month
- ☆54Updated this week
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆43Updated 10 months ago
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆30Updated 4 months ago
- rocWMMA☆92Updated this week
- ☆37Updated 3 years ago
- OpenAI Triton backend for Intel® GPUs☆143Updated this week
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)☆34Updated 2 months ago
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆75Updated last week
- Data Dependence Analyzer in the Polyhedral Model☆19Updated last year
- Conversions to MLIR EmitC☆124Updated 3 months ago
- ☆30Updated this week
- SYCL Reference Manual☆26Updated 6 months ago
- Stretching GPU performance for GEMMs and tensor contractions.☆223Updated this week
- ☆20Updated 9 months ago
- A sandbox for quick iteration and experimentation on projects related to IREE, MLIR, and LLVM☆55Updated 2 months ago
- Examples showing how to utilize the NVML library for GPU monitoring☆26Updated 2 years ago
- Bandwidth test for ROCm☆49Updated this week
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆68Updated 10 months ago
- IREE plugin repository for the AMD AIE accelerator☆69Updated this week
- portDNN is a library implementing neural network algorithms written using SYCL☆109Updated 6 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆35Updated 6 months ago
- An extension library of WMMA API (Tensor Core API)☆84Updated 4 months ago