noppoMan / python-metal-benchmark
An experimental repo for accessing Metal API from Python (OSX Only)
☆23Updated 4 years ago
Alternatives and similar repositories for python-metal-benchmark
Users that are interested in python-metal-benchmark are comparing it to the libraries listed below
Sorting:
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆139Updated 2 years ago
- ☆52Updated 3 years ago
- C API for MLX☆108Updated 3 weeks ago
- Running linear algebra as fast as possible on Apple silicon☆20Updated last year
- Print all known information about the GPU on Apple-designed chips☆78Updated 8 months ago
- Swift for NNC☆75Updated this week
- Python + Apple Metal Framework = ?☆34Updated 6 years ago
- Metal Shading Language on Apple M1's GPU for scientific C++.☆93Updated last year
- 3X speedup over Apple’s TensorFlow plugin by using Apache TVM on M1☆136Updated 3 years ago
- Convert StableHLO models into Apple Core ML format☆19Updated 3 weeks ago
- mlx image models for Apple Silicon machines☆78Updated last month
- ☆51Updated 9 months ago
- Exploring the scalable matrix extension of the Apple M4 processor☆175Updated 6 months ago
- Swift for TensorFlow☆31Updated 2 years ago
- Benchmarking OpenBLAS on the Apple M1☆18Updated 4 years ago
- ModernBERT model optimized for Apple Neural Engine.☆25Updated 4 months ago
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆106Updated 4 months ago
- 1D, 2D, and 3D variations of Fast Fourier Transforms☆32Updated 3 years ago
- Emulating double-precision arithmetic on Apple GPUs☆49Updated 2 years ago
- CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.☆103Updated 4 months ago
- A Python interface to the MAGMA libraries☆10Updated 8 years ago
- Python bindings for ggml☆140Updated 8 months ago
- Convert your ONNX models into Swift for TensorFlow or Metal Performance Shaders (WIP)☆24Updated 3 years ago
- FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme☆63Updated last month
- LLM inference in Fortran☆58Updated 11 months ago
- Apple GPU microarchitecture☆520Updated 7 months ago
- ☆56Updated 2 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆52Updated last month
- ☆21Updated 2 months ago
- A fork of tinygrad made to work with sparse tensors. Sparse neural networks are here!☆11Updated 3 years ago