sphericalcylinder / MetalComputeLinks
A C++ wrapper for the Apple metal-cpp library to make it easier to run compute kernels on the GPU
☆10Updated 7 months ago
Alternatives and similar repositories for MetalCompute
Users that are interested in MetalCompute are comparing it to the libraries listed below
Sorting:
- Emulating double-precision arithmetic on Apple GPUs☆58Updated 2 years ago
- a verbose example on using metal with C++ to perform arbitrary compute on GPUs☆16Updated 4 months ago
- Scientific computing with Metal in C++: Matrix multiplication example☆46Updated 3 years ago
- A python library to run metal compute kernels on macOS☆87Updated last year
- Metal Shading Language on Apple M1's GPU for scientific C++.☆106Updated 2 years ago
- portFFT is a library implementing Fast Fourier Transforms using SYCL☆19Updated 11 months ago
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆149Updated 3 years ago
- Running linear algebra as fast as possible on Apple silicon☆28Updated 2 years ago
- Software library for FDTD of viscoelastic equation using a staggered grid arrangement with support for GPU and CPU backends☆58Updated last week
- SYCL for Vitis: Experimental fusion of triSYCL with Intel SYCL oneAPI DPC++ up-streaming effort into Clang/LLVM☆124Updated last year
- A small C OpenCL wrapper☆17Updated 8 years ago
- C++ HPC Tutorial materials☆54Updated 3 months ago
- Synchronous, single-threaded, library-only SYCL implementation for debugging and verification.☆38Updated 4 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆198Updated this week
- A header only library implementing common mathematical functions using SIMD intrinsics☆114Updated 4 months ago
- BLIS fork with kernels for Apple M1. (Perhaps) The first open-source BLAS with Apple Matrix Coprocessor support.☆36Updated 3 years ago
- my bookmarks☆55Updated last year
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆97Updated last month
- SYCL Benchmark Suite☆67Updated 7 months ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆57Updated 10 months ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆95Updated 4 years ago
- An Awesome list of oneAPI projects☆158Updated 5 months ago
- A collection of samples written using the SYCL standard for C++.☆25Updated last month
- Counter-based random number generators for C, C++ and CUDA.☆112Updated last year
- A Collection of Articles and other OpenCL Papers☆60Updated 6 years ago
- BLAS implementation for Intel FPGA☆78Updated 5 years ago
- CMake modules to support compiling Apple Metal shaders as part of a CMake build system.☆22Updated 8 months ago
- AMD’s C++ library for accelerating tensor primitives☆48Updated last week
- An implementation of HIP that works on CPUs, across OSes.☆131Updated last year
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆51Updated last year