SX-Aurora / veda
VEDA (VE Driver API)
☆15Updated 2 months ago
Alternatives and similar repositories for veda:
Users that are interested in veda are comparing it to the libraries listed below
- Another|Alternative|Awesome VE Offloading stack using ve-urpc☆14Updated last year
- llvm-project cloned from https://github.com/llvm/llvm-project and modified for VE☆17Updated last month
- This is the git repository for RIKEN simulator designed to simulate the binary code for Fujitsu A64FX.☆34Updated 4 years ago
- SX-Aurora TSUBASA Vector Engine Operating System core☆13Updated 2 months ago
- Python bindings for VE Offloading (VEO) for SX-Aurora Vector Engine☆16Updated last year
- A unified framework across multiple programming platforms☆33Updated 6 months ago
- A SYCL Implementation for CPU and SX-Aurora TSUBASA☆50Updated last year
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆45Updated 3 years ago
- PMIx Reference RunTime Environment (PRRTE)☆36Updated this week
- ☆50Updated 4 years ago
- Base container for developing C++ and Fortran HPC applications☆17Updated 2 years ago
- OpenSHMEM Application Programming Interface☆51Updated 2 months ago
- Omni Compiler for C and Fortran programs with XcalableMP and OpenACC directives☆61Updated last year
- Simplified Interface to Complex Memory☆27Updated last year
- Data Dependence Analyzer in the Polyhedral Model☆19Updated last year
- Bandwidth test for ROCm☆52Updated this week
- RAJA Performance Suite☆117Updated this week
- ASM generation tool for GAS/NASM/MASM with Xbyak-like syntax in Python☆12Updated 3 weeks ago
- Light weight thread library☆65Updated 2 months ago
- Official BOLT Repository☆28Updated 5 months ago
- This is a mirror of https://gitlab.inria.fr/starpu/starpu where our development happens, but contributions are welcome here too!☆68Updated this week
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆106Updated this week
- Instrumentation framework to generate execution traces of the most used parallel runtimes.☆66Updated 2 months ago
- ☆37Updated 2 months ago
- An HPL-AI implementation for Fugaku☆19Updated 3 years ago
- Extended Roofline Model - LLVM source tree with additional libraries for the analysis of the dynamic execution in the interpreter☆17Updated 7 years ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆112Updated 2 weeks ago
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆62Updated last month
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆82Updated this week
- Logger for MPI communication☆26Updated last year