☆27Oct 25, 2021Updated 4 years ago
Alternatives and similar repositories for NVPTX-SPIRV-Translator
Users that are interested in NVPTX-SPIRV-Translator are comparing it to the libraries listed below
Sorting:
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆44Oct 25, 2021Updated 4 years ago
- ☆11Apr 3, 2023Updated 2 years ago
- Presentation for JuliaCon 2022 on precompilation☆15Aug 18, 2023Updated 2 years ago
- ☆21Updated this week
- XuanTie vendor extension Instruction Set spec☆44May 30, 2025Updated 9 months ago
- General Purpose Graphics Processing Unit (GPGPU) IP Core☆11Jul 4, 2014Updated 11 years ago
- Generate versal system design from ONNX model. AI engine kernels. Sub-microsecond speeds for autoencoders.☆16Dec 29, 2024Updated last year
- ☆13May 3, 2019Updated 6 years ago
- ☆14May 28, 2019Updated 6 years ago
- CAKE Library for constant-bandwidth matrix multiplication on CPUs☆14Apr 6, 2024Updated last year
- study of Ampere' Sparse Matmul☆18Jan 10, 2021Updated 5 years ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated last year
- x86 Hardware Performance Counter monitoring in Julia☆20Apr 26, 2022Updated 3 years ago
- USB stack for Atmel ATxmega32A4U and related parts☆37Feb 26, 2017Updated 9 years ago
- ☆20Mar 14, 2023Updated 2 years ago
- Fundamental Sources for Water Wave Animation☆20Dec 8, 2022Updated 3 years ago
- The Yao compiler project☆21Dec 12, 2021Updated 4 years ago
- bhSPARSE: A Sparse BLAS Library☆17Nov 6, 2015Updated 10 years ago
- Clang compiler infrastructure for Julia☆22Jul 5, 2025Updated 8 months ago
- 如何做技术演讲(how to give a talk)的slide☆22Feb 8, 2021Updated 5 years ago
- Testing new ideas for array iteration☆20Dec 13, 2020Updated 5 years ago
- A version of the STREAM benchmark which measures the sustainable memory bandwidth.☆28Dec 15, 2025Updated 2 months ago
- ☆27Nov 27, 2023Updated 2 years ago
- Parsers for CUDA binary files☆24Dec 29, 2023Updated 2 years ago
- Evaluating different memory managers for dynamic GPU memory☆26Dec 16, 2020Updated 5 years ago
- An MLIR-based toy DL compiler for TVM Relay.☆61Oct 16, 2022Updated 3 years ago
- Exercises in avoiding common performance traps with Julia☆28May 15, 2023Updated 2 years ago
- Official implementation of Neurips 2020 "Sparse Weight Activation Training" paper.☆29Jul 23, 2021Updated 4 years ago
- [ICCV 2021] Code release for "Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks"☆32Jul 24, 2022Updated 3 years ago
- ☆64Feb 10, 2025Updated last year
- linux bsp app & sample for axpi pro (ax650n)☆31Nov 12, 2024Updated last year
- GEMM and Winograd based convolutions using CUTLASS☆28Jul 15, 2020Updated 5 years ago
- Sample programs for the LLVM PTX back-end☆41Aug 27, 2015Updated 10 years ago
- ☆27Oct 26, 2019Updated 6 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆31Jul 7, 2020Updated 5 years ago
- Utilities for accessing AMD's Machine-Readable GPU ISA Specifications.☆46Sep 24, 2025Updated 5 months ago
- Various examples for Chisel HDL☆30Mar 20, 2022Updated 3 years ago
- The note of Qualcomm OpenCL SDK☆37Nov 8, 2018Updated 7 years ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆143Jan 3, 2025Updated last year