☆74Jun 29, 2023Updated 2 years ago
Alternatives and similar repositories for nvvmir-samples
Users that are interested in nvvmir-samples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python bindings for libNVVM☆38Apr 3, 2014Updated 11 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆124Apr 18, 2025Updated 11 months ago
- An llvm pass for counting global uncoalesced acceses for cuda code via dynamic analysis.☆14Nov 17, 2018Updated 7 years ago
- cuASR: CUDA Algebra for Semirings☆45Aug 22, 2022Updated 3 years ago
- LLVM Plugin to Instrument Global Memory Accesses in CUDA Kernels☆10Jun 8, 2020Updated 5 years ago
- SYSU-ARCH is a LAB that focuses on the use and extending of simulators.☆10Dec 19, 2022Updated 3 years ago
- A framework for pipelined computing on GPU☆30Jul 17, 2019Updated 6 years ago
- ☆67Oct 10, 2024Updated last year
- CUPTI GPU Profiler☆40Feb 26, 2019Updated 7 years ago
- Project ARES represents a joint effort between LANL and ORNL to introduce a common compiler representation and tool-chain for HPC applica…☆10Nov 30, 2016Updated 9 years ago
- outline and links for PLDI 2022 tutorial☆17Jun 13, 2022Updated 3 years ago
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆52Mar 2, 2026Updated 3 weeks ago
- ☆80Nov 16, 2020Updated 5 years ago
- GLSL code generator to aid use of Vulkan's descriptor set indexing☆14Apr 20, 2019Updated 6 years ago
- Multiple 1-stencil implementations using nvidia cuda.☆12Dec 2, 2017Updated 8 years ago
- ☆19Nov 21, 2022Updated 3 years ago
- ☆21May 17, 2015Updated 10 years ago
- some RL algorithms☆19Dec 9, 2016Updated 9 years ago
- ngAP's artifact for ASPLOS'24☆26Jul 29, 2025Updated 7 months ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Apr 2, 2025Updated 11 months ago
- GPUOCelot: A dynamic compilation framework for PTX☆290Jul 31, 2023Updated 2 years ago
- Colby Hall's C++ Standard Library☆11Jan 13, 2020Updated 6 years ago
- Sample programs for the LLVM PTX back-end☆41Aug 27, 2015Updated 10 years ago
- Multi-GPU dynamic scheduler using PGAS style cross-GPU communication☆29Jul 23, 2023Updated 2 years ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆45Oct 25, 2021Updated 4 years ago
- Source code of the simulator used in the Mosaic paper from MICRO 2017: "Mosaic: A GPU Memory Manager with Application-Transparent Support…☆50Aug 21, 2018Updated 7 years ago
- The SHOC Benchmark Suite☆259Oct 6, 2025Updated 5 months ago
- ☆25Mar 26, 2025Updated 11 months ago
- GPGPU-SIM 使用篇☆14Nov 12, 2022Updated 3 years ago
- ☆20Feb 21, 2022Updated 4 years ago
- crossplatform work with serial port☆22Nov 21, 2022Updated 3 years ago
- Fast Point Overlap Test☆18Jun 17, 2018Updated 7 years ago
- A Taichi implementation of WCSPH☆16Dec 3, 2021Updated 4 years ago
- Adobe's C++ Performance Benchmarks for modern compilers (and build systems)☆12Aug 3, 2019Updated 6 years ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆33Mar 15, 2021Updated 5 years ago
- A decentralized unique ID generator (int64)☆22Jun 15, 2016Updated 9 years ago
- LHCSim is a 3D physics simulation engine developed based on taichi☆17Jul 20, 2022Updated 3 years ago
- A GPU FP32 computation method with Tensor Cores.☆26Dec 8, 2025Updated 3 months ago
- A cron job wrapper that wraps jobs and enables better error reproting and command timeouts.☆29Feb 1, 2022Updated 4 years ago