michael-lehn / ulmBLAS
ulmBLAS
☆104Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for ulmBLAS
- sparse matrix pre-processing library☆81Updated 6 months ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆45Updated 9 years ago
- High-Performance Tensor Transpose library☆185Updated last year
- ☆11Updated 8 years ago
- CUDA Tensor Transpose (cuTT) library☆50Updated 7 years ago
- Full-speed Array of Structures access☆162Updated last year
- CUSP : A C++ Templated Sparse Matrix Library☆404Updated 2 weeks ago
- Sympiler is a Code Generator for Transforming Sparse Matrix Codes☆42Updated last year
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 4 years ago
- A massively-parallel, block-sparse tensor framework written in C++☆260Updated this week
- An implementation of BLAS using the SYCL open standard.☆259Updated 3 weeks ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆102Updated last year
- Tensor Contraction Code Generator☆36Updated 7 years ago
- Python wrapper for isl, an integer set library☆73Updated last week
- Julia ports of the Rodinia benchmark suite for heterogeneous computing infrastructures☆48Updated last year
- ☆132Updated last year
- A Sound and Complete Verification Tool for Warp-Specialized GPU Kernels☆18Updated 9 years ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆196Updated 2 weeks ago
- Kernel Tuning Toolkit☆55Updated 3 weeks ago
- a software library containing Sparse functions written in OpenCL☆173Updated 4 years ago
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆65Updated last month
- Library to plot integer sets and maps☆47Updated 7 years ago
- Recursive LAPACK Collection☆42Updated 2 years ago
- Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)☆117Updated 2 years ago
- High-performance object-based library for DLA computations☆235Updated 6 months ago
- Cyclops Tensor Framework: parallel arithmetic on multidimensional arrays☆201Updated 3 months ago
- Fast matrix multiplication☆28Updated 3 years ago
- Automatically Tuned Linear Algebra Software (ATLAS)☆174Updated 4 years ago
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆415Updated last week
- Autonomic Performance Environment for eXascale (APEX)☆38Updated 3 weeks ago