noppoMan / python-metal-benchmarkLinks

An experimental repo for accessing Metal API from Python (OSX Only)

☆23

Alternatives and similar repositories for python-metal-benchmark

Users that are interested in python-metal-benchmark are comparing it to the libraries listed below

Sorting:

ShoYamanishi / AppleNumericalComputing
Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices
☆144Updated 2 years ago
philipturner / amx-benchmarks
Running linear algebra as fast as possible on Apple silicon
☆20Updated last year
wtnb75 / runmetal
Python + Apple Metal Framework = ?
☆34Updated 6 years ago
hollance / MPS-Matrix-Multiplication
Playing with the Metal Performance Shaders matrix multiplication kernel
☆25Updated 8 years ago
philipturner / metal-float64
Emulating double-precision arithmetic on Apple GPUs
☆54Updated 2 years ago
tzakharko / m4-sme-exploration
Exploring the scalable matrix extension of the Apple M4 processor
☆180Updated 7 months ago
s4tf / s4tf
Swift for TensorFlow
☆31Updated 2 years ago
iree-org / iree-jax
☆52Updated 10 months ago
tlkh / m1-cpu-benchmarks
☆52Updated 3 years ago
philipturner / applegpuinfo
Print all known information about the GPU on Apple-designed chips
☆81Updated 10 months ago
philipturner / molecular-renderer
Renderer for molecular nanotechnology
☆67Updated last week
octoml / Apple-M1-BERT
3X speedup over Apple’s TensorFlow plugin by using Apache TVM on M1
☆136Updated 3 years ago
kongzii / SwiftXGBoost
Swift wrapper for XGBoost gradient boosting machine learning framework with Numpy and TensorFlow support.
☆26Updated 4 years ago
ProteusMRIgHIFU / BabelViscoFDTD
Software library for FDTD of viscoelastic equation using a staggered grid arrangement with support for GPU and CPU backends
☆55Updated 3 months ago
larsgeb / m1-gpu-cpp
Metal Shading Language on Apple M1's GPU for scientific C++.
☆93Updated last year
philipturner / metal-fft
1D, 2D, and 3D variations of Fast Fourier Transforms
☆32Updated 3 years ago
kasper0406 / stablehlo-coreml
Convert StableHLO models into Apple Core ML format
☆19Updated 2 months ago
salykova / sgemm.cu
High-Performance SGEMM on CUDA devices
☆96Updated 5 months ago
bkvogel / metal_performance_testing
Scientific computing with Metal in C++: Matrix multiplication example
☆31Updated 2 years ago
ml-explore / mlx-c
C API for MLX
☆115Updated 2 months ago
philipturner / metal-flash-attention
FlashAttention (Metal Port)
☆497Updated 9 months ago
philipturner / metal-benchmarks
Apple GPU microarchitecture
☆527Updated 9 months ago
eiln / anecc
Run a CoreML MLModel on the Asahi Neural Engine
☆54Updated 2 years ago
danielchalef / openblas-benchmark-m1
Benchmarking OpenBLAS on the Apple M1
☆18Updated 4 years ago
geohot / ctypeslib
Generate python ctypes classes from C headers. Requires LLVM clang
☆14Updated 10 months ago
liuliu / s4nnc
Swift for NNC
☆75Updated 2 weeks ago
RobertRiachi / ANE-Optimized-Whisper-OpenAI
☆55Updated 2 years ago
scalable-analyses / sme
☆26Updated 2 months ago
google-research / structured-additive-IR
☆61Updated this week
nod-ai / SRT
Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …
☆106Updated 5 months ago