springer13 / hptt
High-Performance Tensor Transpose library
☆195Updated last year
Alternatives and similar repositories for hptt:
Users that are interested in hptt are comparing it to the libraries listed below
- Cyclops Tensor Framework: parallel arithmetic on multidimensional arrays☆203Updated 9 months ago
- Tensor Contraction C++ Library☆52Updated 5 years ago
- A massively-parallel, block-sparse tensor framework written in C++☆287Updated this week
- Full-speed Array of Structures access☆169Updated 2 years ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆46Updated 10 years ago
- TBLIS is a library and framework for performing tensor operations, especially tensor contraction, using efficient native algorithms.☆121Updated last month
- CUDA Tensor Transpose (cuTT) library☆51Updated 7 years ago
- A C++ library for computing large scale tensor contractions.☆38Updated 6 years ago
- ulmBLAS☆106Updated 3 years ago
- Tensor Contraction Code Generator☆37Updated 7 years ago
- sparse matrix pre-processing library☆81Updated last year
- C++ library for numerical arrays and tensor objects and operations with them, designed to allow Matlab-style programming.☆52Updated last year
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- ArrayFire's Machine Learning Library.☆104Updated 6 years ago
- Vector Math Library☆79Updated 8 years ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 3 months ago
- BLAS extension to xtensor☆167Updated 3 weeks ago
- Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.☆347Updated 3 years ago
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆533Updated last month
- Flexible Library for Efficient Numerical Solutions☆127Updated 3 years ago
- Fast automatic differentiation library in C++☆106Updated 3 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 8 years ago
- CUSP : A C++ Templated Sparse Matrix Library☆412Updated 6 months ago
- Partitioned Global Address Space (PGAS) library for distributed arrays☆102Updated this week
- Fork of magma to include more BLAS☆28Updated 8 years ago
- Developer repository for ViennaCL. Visit http://viennacl.sourceforge.net/ for the latest releases.☆286Updated 3 years ago
- Data parallel C++ mathematical object library☆163Updated last week
- IPython / Jupyter integration for pybind11☆67Updated 7 years ago
- Automatic differentiation in C++; infinite differentiability of conditionals, loops, recursion and all things C++☆151Updated 6 years ago