Tohoku-University-Takizawa-Lab / neoSYCLLinks
A SYCL Implementation for CPU and SX-Aurora TSUBASA
☆53Updated 2 years ago
Alternatives and similar repositories for neoSYCL
Users that are interested in neoSYCL are comparing it to the libraries listed below
Sorting:
- This is the git repository for RIKEN simulator designed to simulate the binary code for Fujitsu A64FX.☆36Updated 5 years ago
- Omni Compiler for C and Fortran programs with XcalableMP and OpenACC directives☆61Updated last year
- SYCL Reference Manual☆28Updated last year
- Library of High Precision Sparse Matrix Operations Accelerated by SIMD☆42Updated 3 years ago
- MPI accelerator-integrated communication extensions☆33Updated 2 years ago
- Reusable software components for ROCm developers☆84Updated this week
- World championship code for Graph500☆25Updated last year
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆54Updated 2 weeks ago
- ROCm SPARSE marshalling library☆67Updated this week
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆56Updated last month
- Tutorials for ARM SVE on Docker☆43Updated 2 years ago
- HPCG benchmark based on ROCm platform☆37Updated last week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆120Updated this week
- A unified framework across multiple programming platforms☆38Updated last week
- instruction-bench☆36Updated 2 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆87Updated 3 weeks ago
- RAJA Performance Suite☆117Updated last week
- An HPL-AI implementation for Fugaku☆21Updated 3 years ago
- Official BOLT Repository☆28Updated 9 months ago
- Next generation LAPACK implementation for ROCm platform☆101Updated last week
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 2 years ago
- AMD’s C++ library for accelerating tensor primitives☆41Updated this week
- SYCL Benchmark Suite☆64Updated 3 months ago
- ☆18Updated last year
- mallocMC: Memory Allocator for Many Core Architectures☆55Updated 3 weeks ago
- VEDA (VE Driver API)☆17Updated 3 months ago
- Itoyori: A distributed multi-threading runtime system for global-view fork-join task parallelism☆20Updated last year
- Autonomic Performance Environment for eXascale (APEX)☆48Updated 2 weeks ago
- ☆36Updated last week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆108Updated 2 years ago