SX-Aurora / py-veo
Python bindings for VE Offloading (VEO) for SX-Aurora Vector Engine
☆16Updated last year
Alternatives and similar repositories for py-veo:
Users that are interested in py-veo are comparing it to the libraries listed below
- Another|Alternative|Awesome VE Offloading stack using ve-urpc☆14Updated last year
- VEDA (VE Driver API)☆17Updated last month
- Library of High Precision Sparse Matrix Operations Accelerated by SIMD☆42Updated 3 years ago
- NLCPy : NumPy-like API accelerated with SX-Aurora TSUBASA☆15Updated last year
- Dynamic execution environments for coupled, thread-heterogeneous MPI+X applications☆21Updated 3 weeks ago
- Omni Compiler for C and Fortran programs with XcalableMP and OpenACC directives☆61Updated last year
- Base container for developing C++ and Fortran HPC applications☆18Updated 2 years ago
- A SYCL Implementation for CPU and SX-Aurora TSUBASA☆52Updated 2 years ago
- SX-Aurora TSUBASA Vector Engine Operating System core☆13Updated 3 weeks ago
- This repository mirrors the principal Gitlab repository of the Chebyshev Accelerated Subspace iteration Eigensolver. If you want to contr…☆16Updated last month
- A BUDE virtual-screening benchmark, in many programming models☆27Updated 5 months ago
- Department of Energy Standard Utility Library☆31Updated 3 weeks ago
- Autonomic Performance Environment for eXascale (APEX)☆44Updated this week
- Linux Cross-Memory Attach☆17Updated 4 months ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆51Updated 2 weeks ago
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆66Updated last week
- TTG: Template Task Graph C++ API☆19Updated last month
- Performance engineering for the rest of us.☆30Updated last year
- PMIx Reference RunTime Environment (PRRTE)☆37Updated this week
- Comb is a communication performance benchmarking tool.☆24Updated 2 years ago
- Basic Tensor Algebra Subroutines☆47Updated last week
- An MPI ABI compatibility layer☆32Updated 2 weeks ago
- An implementation of ARMCI using MPI one-sided communication (RMA)☆14Updated 5 months ago
- QMCPACK miniapp: a simplified real space QMC code for algorithm development, performance portability testing, and computer science experi…☆27Updated 8 months ago
- Double precision SIMD-oriented Fast Mersenne Twister☆39Updated 2 years ago
- ☆34Updated 4 years ago
- High-level framework for stencil computations☆40Updated 9 years ago
- instruction-bench☆36Updated 2 years ago
- Partitioned Global Address Space (PGAS) library for distributed arrays☆101Updated this week
- ReMPI (MPI Record-and-Replay)☆39Updated 9 months ago