khaki3 / ptxas-wrapperLinks
A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code
☆15Updated 2 years ago
Alternatives and similar repositories for ptxas-wrapper
Users that are interested in ptxas-wrapper are comparing it to the libraries listed below
Sorting:
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆39Updated 3 years ago
- ☆41Updated 2 weeks ago
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆16Updated 3 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆52Updated 2 months ago
- SYCL Reference Manual☆28Updated last year
- A novel spatial accelerator for horizontal diffusion weather stencil computation, as described in ICS 2023 paper by Singh et al. (https:/…☆19Updated last year
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆30Updated 8 months ago
- ☆52Updated 5 years ago
- ☆35Updated 3 years ago
- FPGA acceleration of arbitrary precision floating point computations.☆40Updated 3 years ago
- TAPA is a dataflow HLS framework that features fast compilation, expressive programming model and generates high-frequency FPGA accelerat…☆19Updated 9 months ago
- HeteroCL-MLIR dialect for accelerator design☆41Updated 8 months ago
- Polyhedral High-Level Synthesis in MLIR☆31Updated 2 years ago
- Example for running IREE in a bare-metal Arm environment.☆33Updated 3 months ago
- ☆21Updated 3 years ago
- ☆23Updated 3 weeks ago
- CUDAAdvisor: a GPU profiling tool☆49Updated 6 years ago
- ☆55Updated 6 years ago
- ☆29Updated 2 years ago
- 📥 🎯 (1,4/4) an MLIR-based toolchain with Vitis HLS LLVM input/output targeting FPGAs.☆14Updated 2 years ago
- ☆23Updated 3 years ago
- ☆19Updated last week
- A Benchmark Toolkit for Assembly Instructions Using the LLVM JIT☆16Updated 4 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 5 years ago
- GPU Performance Advisor☆65Updated 2 years ago
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆23Updated 6 months ago
- LLVM-Canon aims to transform LLVM modules into a canonical form by reordering and renaming instructions while preserving the same semanti…☆15Updated last year
- Lake is a framework for generating synthesizable memory modules from a high-level behavioral specification and widely-available memory ma…☆22Updated last week
- Parendi: Thousand-way Parallel RTL Simulation on the Graphcore IPU☆22Updated last year
- HeteroHalide: From Image Processing DSL to Efficient FPGA Acceleration☆15Updated 4 years ago