flame / blis
BLAS-like Library Instantiation Software Framework
☆2,378Updated this week
Alternatives and similar repositories for blis:
Users that are interested in blis are comparing it to the libraries listed below
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆864Updated this week
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,279Updated 10 months ago
- Patterns and behaviors for GPU computing☆1,699Updated 2 years ago
- Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction☆2,109Updated this week
- SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT☆697Updated this week
- CUDA Core Compute Libraries☆1,468Updated this week
- Tuned OpenCL BLAS☆1,084Updated 3 months ago
- LAPACK development repository☆1,583Updated 2 weeks ago
- Thin, unified, C++-flavored wrappers for the CUDA APIs☆821Updated this week
- oneAPI Math Library (oneMath)☆645Updated 3 weeks ago
- A lightweight high performance tensor algebra framework for modern C++☆771Updated 10 months ago
- C++ tensors with broadcasting and lazy computing☆3,444Updated this week
- The official SuiteSparse library: a suite of sparse matrix algorithms authored or co-authored by Tim Davis, Texas A&M University.☆1,237Updated this week
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,721Updated last year
- OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.☆6,584Updated this week
- Intel® Implicit SPMD Program Compiler☆2,593Updated this week
- ArrayFire: a general purpose GPU library.☆4,629Updated this week
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,297Updated last year
- A header-only C++ library for numerical optimization --☆766Updated last month
- High-performance object-based library for DLA computations☆239Updated 9 months ago
- ☆1,813Updated last year
- Assembler for NVIDIA Maxwell architecture☆968Updated 2 years ago
- a software library containing BLAS functions written in OpenCL☆851Updated 6 months ago
- The Legion Parallel Programming System☆707Updated last month
- High-performance automatic differentiation of LLVM and MLIR.☆1,338Updated this week
- An efficient C++17 GPU numerical computing library with Python-like syntax☆1,262Updated this week
- Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C+…☆1,510Updated this week
- A code generator for array-based code on CPUs and GPUs☆597Updated this week
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆420Updated 2 months ago
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))☆2,309Updated this week