halide / Halide
a language for fast, portable data-parallel computation
☆5,947Updated this week
Alternatives and similar repositories for Halide:
Users that are interested in Halide are comparing it to the libraries listed below
- ArrayFire: a general purpose GPU library.☆4,602Updated this week
- [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl☆4,946Updated 11 months ago
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆2,929Updated this week
- "Multi-Level Intermediate Representation" Compiler Infrastructure☆1,735Updated 3 years ago
- HIP: C++ Heterogeneous-Compute Interface for Portability☆3,834Updated this week
- header only, dependency-free deep learning framework in C++14☆5,873Updated 2 years ago
- oneAPI Deep Neural Network Library (oneDNN)☆3,677Updated this week
- Abseil Common Libraries (C++)☆15,311Updated this week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆11,930Updated last month
- Intel® Implicit SPMD Program Compiler☆2,570Updated this week
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,296Updated 11 months ago
- Patterns and behaviors for GPU computing☆1,690Updated 2 years ago
- Compiler for Neural Network hardware accelerators☆3,257Updated 8 months ago
- C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.☆2,100Updated this week
- A C++ GPU Computing Library for OpenCL☆1,570Updated last month
- OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.☆6,519Updated this week
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆1,924Updated this week
- oneAPI Threading Building Blocks (oneTBB)☆5,870Updated this week
- The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologi…☆2,899Updated 3 weeks ago
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))☆2,284Updated this week
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,267Updated 9 months ago
- A superoptimizer for LLVM IR☆2,195Updated 4 months ago
- nGraph has moved to OpenVINO☆1,350Updated 4 years ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,690Updated last year
- SWIG is a software development tool that connects programs written in C and C++ with a variety of high-level programming languages.☆5,865Updated last month
- ☆1,656Updated 6 years ago
- Conan - The open-source C and C++ package manager☆8,438Updated this week
- Main gperftools repository☆8,537Updated last month
- A domain specific language to express machine learning workloads.☆1,759Updated last year
- AddressSanitizer, ThreadSanitizer, MemorySanitizer☆11,679Updated last week