Tohoku-University-Takizawa-Lab / neoSYCL
A SYCL Implementation for CPU and SX-Aurora TSUBASA
☆52Updated 2 years ago
Alternatives and similar repositories for neoSYCL:
Users that are interested in neoSYCL are comparing it to the libraries listed below
- This is the git repository for RIKEN simulator designed to simulate the binary code for Fujitsu A64FX.☆35Updated 4 years ago
- Library of High Precision Sparse Matrix Operations Accelerated by SIMD☆42Updated 3 years ago
- ☆15Updated 2 years ago
- ROCm SPARSE marshalling library☆67Updated last week
- Official BOLT Repository☆28Updated 7 months ago
- Omni Compiler for C and Fortran programs with XcalableMP and OpenACC directives☆61Updated last year
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆106Updated this week
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆51Updated last month
- ASM generation tool for GAS/NASM/MASM with Xbyak-like syntax in Python☆12Updated 2 weeks ago
- Reusable software components for ROCm developers☆83Updated this week
- Tutorials for ARM SVE on Docker☆43Updated 2 years ago
- ☆17Updated last year
- Another|Alternative|Awesome VE Offloading stack using ve-urpc☆14Updated last year
- HPCG benchmark based on ROCm platform☆37Updated last week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆106Updated last year
- RAJA Performance Suite☆118Updated this week
- Advanced Profiling and Analytics for AMD Hardware☆142Updated this week
- instruction-bench☆36Updated 2 years ago
- AMD’s C++ library for accelerating tensor primitives☆39Updated this week
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆39Updated this week
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆48Updated this week
- SYCL Benchmark Suite☆64Updated last month
- A tracing infrastructure for heterogeneous computing applications.☆30Updated this week
- Itoyori: A distributed multi-threading runtime system for global-view fork-join task parallelism☆20Updated last year
- CUDA Template Functions☆19Updated 3 months ago
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆58Updated last week
- World championship code for Graph500☆25Updated last year
- VEDA (VE Driver API)☆17Updated last month
- Distributed View Extension for Kokkos☆45Updated 3 months ago
- ☆44Updated this week