andre-wojtowicz / blas-benchmarks
Timing results for BLAS (Basic Linear Algebra Subprograms) libraries in R
☆31Updated 8 years ago
Alternatives and similar repositories for blas-benchmarks:
Users that are interested in blas-benchmarks are comparing it to the libraries listed below
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- sparse matrix pre-processing library☆81Updated 8 months ago
- ☆31Updated 3 years ago
- ulmBLAS☆104Updated 2 years ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- Fast matrix multiplication☆29Updated 3 years ago
- Codebase associated with the PyTorch compiler tutorial☆44Updated 5 years ago
- Recursive LAPACK Collection☆42Updated 2 years ago
- Flexible Library for Efficient Numerical Solutions☆127Updated 3 years ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- Kernel Tuning Toolkit☆56Updated 2 months ago
- xtensor plugin to read and write images, audio files, numpy (compressed) npz and HDF5☆84Updated 9 months ago
- Generalized Histograms for CUDA-capable GPUs☆43Updated 9 years ago
- Full-speed Array of Structures access☆164Updated last year
- Benchmark of expression templates libraries☆40Updated 4 years ago
- A library to benchmark CUDA code, similar to google benchmark.☆28Updated 3 years ago
- Easy to use benchmarks for linear algebra frameworks☆24Updated 4 years ago
- BLAS extension to xtensor☆159Updated 5 months ago
- Polymorphic multidimensional array view☆36Updated 4 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆40Updated 6 years ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆109Updated 8 months ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- C++ library for numerical arrays and tensor objects and operations with them, designed to allow Matlab-style programming.☆52Updated last year
- Range-based for loops to iterate over a range of numbers or values☆35Updated 8 years ago
- Blazing-fast Expression Templates Library (ETL) with GPU support, in C++☆221Updated last year
- Portable 128-bit SIMD intrinsics☆57Updated last year
- High-Performance Tensor Transpose library☆190Updated last year
- ☆67Updated 2 years ago
- C++ multidimensional arrays in the spirit of the STL☆199Updated 3 weeks ago