llvm-gpu-news / llvm-gpu-news.github.io
☆15Updated last year
Related projects: ⓘ
- SYCL Conformance Tests☆60Updated last week
- SYCL Reference Manual☆25Updated 4 months ago
- Synchronous, single-threaded, library-only SYCL implementation for debugging and verification.☆25Updated this week
- SYCL Benchmark Suite☆57Updated last week
- Advanced Profiling and Analytics for AMD Hardware☆132Updated last week
- SYCL Open Source Specification☆109Updated this week
- Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.☆19Updated 3 years ago
- Reusable software components for ROCm developers☆81Updated last week
- AMD’s C++ library for accelerating tensor primitives☆35Updated this week
- Codeplay project for contributions to the LLVM SYCL implementation☆30Updated 3 years ago
- floating-point errors checker☆47Updated 2 months ago
- mallocMC: Memory Allocator for Many Core Architectures☆50Updated 3 weeks ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆104Updated last week
- An implementation of HIP that works on CPUs, across OSes.☆109Updated 6 months ago
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆46Updated 2 weeks ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆39Updated 8 months ago
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆52Updated last week
- Experimental OpenCL SPIR-V to OpenCL C translator☆24Updated last week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆78Updated last month
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆100Updated this week
- hipFFT is a FFT marshalling library.☆52Updated this week
- ☆44Updated 7 months ago
- AMD lab notes with code examples to demonstrate use of AMD GPUs☆89Updated 2 months ago
- Unit benchmarks of CUDA event APIs.☆17Updated 4 months ago
- RAJA Performance Suite☆110Updated last week
- Utilities for accessing AMD's Machine-Readable GPU ISA Specifications.☆15Updated 3 weeks ago
- Next generation LAPACK implementation for ROCm platform☆91Updated this week
- ROCm Parallel Primitives☆156Updated this week
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆44Updated 9 years ago
- Implementation of AMD HIP for CPUs☆22Updated 4 years ago