andre-wojtowicz / blas-benchmarksLinks
Timing results for BLAS (Basic Linear Algebra Subprograms) libraries in R
☆31Updated 8 years ago
Alternatives and similar repositories for blas-benchmarks
Users that are interested in blas-benchmarks are comparing it to the libraries listed below
Sorting:
- Flexible Library for Efficient Numerical Solutions☆127Updated 2 weeks ago
- BLAS extension to xtensor☆167Updated 2 months ago
- xtensor plugin to read and write images, audio files, numpy (compressed) npz and HDF5☆86Updated last year
- Benchmark of expression templates libraries☆40Updated 5 years ago
- sparse matrix pre-processing library☆82Updated last year
- ☆31Updated 4 years ago
- ☆41Updated 6 years ago
- summary page for Armadillo - https://arma.sourceforge.net☆45Updated 2 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated last year
- ulmBLAS☆107Updated 2 weeks ago
- Full-speed Array of Structures access☆171Updated 2 years ago
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆71Updated 9 years ago
- High-Performance Tensor Transpose library☆199Updated 2 years ago
- Fast matrix multiplication☆29Updated 3 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 5 months ago
- nGraph™ Backend for ONNX☆42Updated 2 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- Recursive LAPACK Collection☆42Updated 3 years ago
- Distributed NMF/NTF Library☆46Updated 6 months ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated 2 years ago
- A fast tensor library for c++.☆11Updated 9 years ago
- ☆14Updated 2 years ago
- Generalized Histograms for CUDA-capable GPUs☆42Updated 9 years ago
- NumPy-compatible multidimensional arrays in C++☆161Updated 8 months ago
- Range-based for loops to iterate over a range of numbers or values☆35Updated 8 years ago
- Implementation of the SYCL specification.☆66Updated last year
- Automatic Differentiation C++ Library☆57Updated 4 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆154Updated 2 years ago