noppoMan / python-metal-benchmarkLinks
An experimental repo for accessing Metal API from Python (OSX Only)
☆23Updated 5 years ago
Alternatives and similar repositories for python-metal-benchmark
Users that are interested in python-metal-benchmark are comparing it to the libraries listed below
Sorting:
- A python library to run metal compute kernels on macOS☆83Updated 8 months ago
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆145Updated 2 years ago
- FlashAttention (Metal Port)☆534Updated last year
- Renderer for molecular nanotechnology☆71Updated this week
- Swift for NNC☆76Updated last week
- Print all known information about the GPU on Apple-designed chips☆88Updated last year
- Python + Apple Metal Framework = ?☆34Updated 6 years ago
- C API for MLX☆132Updated 2 weeks ago
- ☆52Updated 3 years ago
- ☆52Updated last year
- Playing with the Metal Performance Shaders matrix multiplication kernel☆25Updated 8 years ago
- Apple GPU microarchitecture☆550Updated last year
- Emulating double-precision arithmetic on Apple GPUs☆55Updated 2 years ago
- Implementation of Karpathy's micrograd in Mojo☆76Updated last year
- JAX-like Neural Network Training Library in Python with CPU/GPU Acceleration via Mojo and MAX☆279Updated 2 weeks ago
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- Swift for TensorFlow☆31Updated 3 years ago
- Software library for FDTD of viscoelastic equation using a staggered grid arrangement with support for GPU and CPU backends☆55Updated last week
- Metal Shading Language on Apple M1's GPU for scientific C++.☆101Updated last year
- LLM inference in Fortran☆62Updated last year
- High-Performance SGEMM on CUDA devices☆101Updated 8 months ago
- Convert StableHLO models into Apple Core ML format☆19Updated last month
- 1D, 2D, and 3D variations of Fast Fourier Transforms☆34Updated 3 years ago
- Running linear algebra as fast as possible on Apple silicon☆21Updated 2 years ago
- Scientific computing with Metal in C++: Matrix multiplication example☆40Updated 3 years ago
- Benchmarking OpenBLAS on the Apple M1☆18Updated 4 years ago
- 3X speedup over Apple’s TensorFlow plugin by using Apache TVM on M1☆137Updated 3 years ago
- N dimensional array package for numeric computing in swift.☆25Updated last year
- FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme☆87Updated 6 months ago
- Python bindings for ggml☆146Updated last year