andre-wojtowicz / blas-benchmarks
Timing results for BLAS (Basic Linear Algebra Subprograms) libraries in R
☆31Updated 8 years ago
Alternatives and similar repositories for blas-benchmarks:
Users that are interested in blas-benchmarks are comparing it to the libraries listed below
- Recursive LAPACK Collection☆42Updated 3 years ago
- xtensor plugin to read and write images, audio files, numpy (compressed) npz and HDF5☆85Updated 11 months ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- Full-speed Array of Structures access☆164Updated last year
- ☆41Updated 6 years ago
- ulmBLAS☆105Updated 2 years ago
- Flexible Library for Efficient Numerical Solutions☆127Updated 3 years ago
- CUDA kernel author's tools☆110Updated 2 years ago
- ☆31Updated 3 years ago
- Codebase associated with the PyTorch compiler tutorial☆46Updated 5 years ago
- Benchmark of expression templates libraries☆40Updated 4 years ago
- C++ multidimensional arrays in the spirit of the STL☆200Updated 2 months ago
- BLAS extension to xtensor☆162Updated 7 months ago
- npcomp - An aspirational MLIR based numpy compiler☆51Updated 4 years ago
- High-Performance Tensor Transpose library☆190Updated last year
- sparse matrix pre-processing library☆81Updated 10 months ago
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆70Updated 9 years ago
- Easy to use benchmarks for linear algebra frameworks☆24Updated 4 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆111Updated 10 months ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- DLPack for Tensorflow☆36Updated 4 years ago
- summary page for Armadillo - https://arma.sourceforge.net☆44Updated 2 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆40Updated 6 years ago
- FlexiBLAS - A BLAS and LAPACK wrapper library with runtime exchangeable backends. This is a read-only mirror of https://gitlab.mpi-magdeb…☆43Updated 2 weeks ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆149Updated last year
- Symbolic Expression and Statement Module for new DSLs☆205Updated 4 years ago
- Sparse matrix computation library for GPU☆54Updated 4 years ago
- NumPy-compatible multidimensional arrays in C++☆160Updated 5 months ago
- Blazing-fast Expression Templates Library (ETL) with GPU support, in C++☆222Updated last year