SX-Aurora / py-veo
Python bindings for VE Offloading (VEO) for SX-Aurora Vector Engine
☆16Updated last year
Alternatives and similar repositories for py-veo:
Users that are interested in py-veo are comparing it to the libraries listed below
- VEDA (VE Driver API)☆17Updated 2 months ago
- Base container for developing C++ and Fortran HPC applications☆18Updated 2 years ago
- Another|Alternative|Awesome VE Offloading stack using ve-urpc☆14Updated last year
- Omni Compiler for C and Fortran programs with XcalableMP and OpenACC directives☆61Updated last year
- Library of High Precision Sparse Matrix Operations Accelerated by SIMD☆42Updated 3 years ago
- NLCPy : NumPy-like API accelerated with SX-Aurora TSUBASA☆15Updated last year
- QCD for Intel Xeon Phi and Xeon processors☆14Updated last year
- ☆34Updated 5 years ago
- Double precision SIMD-oriented Fast Mersenne Twister☆39Updated 2 years ago
- This repository mirrors the principal Gitlab repository of the Chebyshev Accelerated Subspace iteration Eigensolver. If you want to contr…☆17Updated last week
- A BUDE virtual-screening benchmark, in many programming models☆28Updated 6 months ago
- Mersenne Twister for Graphic Processors☆9Updated 3 years ago
- Yaksa: High-performance Noncontiguous Data Management☆13Updated 7 months ago
- P3DFFT stands for Parallel Three-Dimensional Fast Fourier Transforms. It is a library for large-scale computer simulations on parallel pl…☆58Updated 2 years ago
- Parallel fast Fourier transforms☆55Updated 6 years ago
- ☆51Updated 4 years ago
- An MPI ABI compatibility layer☆32Updated last month
- ☆39Updated this week
- floating-point errors checker☆56Updated last week
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆67Updated last month
- Fortran interfaces for ROCm libraries☆75Updated this week
- Dynamic execution environments for coupled, thread-heterogeneous MPI+X applications☆21Updated last month
- QMCPACK miniapp: a simplified real space QMC code for algorithm development, performance portability testing, and computer science experi…☆27Updated 9 months ago
- Multiple-precision GPU accelerated linear algebra routines (dense and sparse) based on residue number system☆17Updated 2 years ago
- Basic Tensor Algebra Subroutines☆48Updated this week
- A SYCL Implementation for CPU and SX-Aurora TSUBASA☆52Updated 2 years ago
- The MPLAPACK: multiple precision version of BLAS and LAPACK☆92Updated 10 months ago
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆58Updated last week
- OCCA Python API: JIT Compilation for Multiple Architectures☆11Updated 5 years ago
- instruction-bench☆36Updated 2 years ago