SourceryTools / nvptx-tools
nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.
☆46Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for nvptx-tools
- SYCL Reference Manual☆26Updated 6 months ago
- OpenSHMEM Application Programming Interface☆51Updated last week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆80Updated this week
- Codeplay project for contributions to the LLVM SYCL implementation☆30Updated 3 years ago
- SYCL Benchmark Suite☆56Updated 2 months ago
- Implementation of AMD HIP for CPUs☆22Updated 4 years ago
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 2 years ago
- Reusable software components for ROCm developers☆79Updated this week
- ROCm - AMDGPU Compute Application Binary Interface☆40Updated 2 years ago
- Compute applications.☆25Updated 4 years ago
- Kernel Tuning Toolkit☆55Updated 3 weeks ago
- An implementation of HIP that works on CPUs, across OSes.☆112Updated 8 months ago
- SYCL Conformance Tests☆62Updated last week
- Loop Kernel Analysis and Performance Modeling Toolkit☆89Updated 2 months ago
- Official BOLT Repository☆28Updated 3 months ago
- Next generation LAPACK implementation for ROCm platform☆95Updated this week
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆66Updated 9 months ago
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆104Updated 3 months ago
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆33Updated 2 years ago
- ☆68Updated 4 years ago
- Intel Heterogeneous Research Compiler (iHRC)☆25Updated last year
- mallocMC: Memory Allocator for Many Core Architectures☆51Updated last week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆100Updated this week
- Par4All is an automatic parallelizing and optimizing compiler (workbench) for C and Fortran sequential programs☆51Updated 9 years ago
- Autonomic Performance Environment for eXascale (APEX)☆38Updated 3 weeks ago
- SYCL Open Source Specification☆116Updated last week
- Automatically exported from code.google.com/p/patus☆15Updated 9 years ago
- ☆14Updated 4 years ago
- An OpenMP runtime implemented using HPX☆23Updated 2 years ago
- Next generation FFT implementation for ROCm☆177Updated this week