andre-wojtowicz / blas-benchmarks
Timing results for BLAS (Basic Linear Algebra Subprograms) libraries in R
☆30Updated 7 years ago
Related projects: ⓘ
- ulmBLAS☆102Updated 2 years ago
- sparse matrix pre-processing library☆81Updated 4 months ago
- xtensor plugin to read and write images, audio files, numpy (compressed) npz and HDF5☆85Updated 5 months ago
- Benchmark of expression templates libraries☆39Updated 4 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- Flexible Library for Efficient Numerical Solutions☆126Updated 2 years ago
- ☆41Updated 5 years ago
- Deep Learning With C++☆29Updated 6 years ago
- ☆30Updated 3 years ago
- Full-speed Array of Structures access☆155Updated last year
- Range-based for loops to iterate over a range of numbers or values☆35Updated 7 years ago
- summary page for Armadillo - https://arma.sourceforge.net☆44Updated 2 years ago
- BLAS extension to xtensor☆155Updated last month
- High-Performance Tensor Transpose library☆183Updated last year
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 4 years ago
- Automatic Differentiation C++ Library☆56Updated 3 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆29Updated 7 years ago
- Fork of magma to include more BLAS☆28Updated 7 years ago
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆70Updated 8 years ago
- A portable high-level API with CUDA or OpenCL back-end☆53Updated 6 years ago
- Vectorizable implementations of some mathematical functions☆102Updated 4 years ago
- A Light-weight and Fast Template Matrix Library☆131Updated 11 years ago
- Easy to use benchmarks for linear algebra frameworks☆24Updated 4 years ago
- C++ library for numerical arrays and tensor objects and operations with them, designed to allow Matlab-style programming.☆51Updated last year
- ☆34Updated this week
- 3D Tensors for Blaze (https://bitbucket.org/blaze-lib/blaze)☆35Updated 3 years ago
- Some C++ codes for computing a 1D and 2D convolution product using the FFT implemented with the GSL or FFTW☆57Updated 11 years ago
- Recursive LAPACK Collection☆42Updated 2 years ago
- NumPy-compatible multidimensional arrays in C++☆160Updated last year
- Parallel network flows using OpenMP and CUDA.☆27Updated 5 years ago