noppoMan / python-metal-benchmarkLinks
An experimental repo for accessing Metal API from Python (OSX Only)
☆23Updated 5 years ago
Alternatives and similar repositories for python-metal-benchmark
Users that are interested in python-metal-benchmark are comparing it to the libraries listed below
Sorting:
- A python library to run metal compute kernels on macOS☆80Updated 6 months ago
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆144Updated 2 years ago
- C API for MLX☆121Updated last month
- Python + Apple Metal Framework = ?☆34Updated 6 years ago
- FlashAttention (Metal Port)☆514Updated 10 months ago
- Renderer for molecular nanotechnology☆69Updated last month
- ☆52Updated last year
- Playing with the Metal Performance Shaders matrix multiplication kernel☆25Updated 8 years ago
- Convert StableHLO models into Apple Core ML format☆19Updated 2 weeks ago
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆106Updated 7 months ago
- 3X speedup over Apple’s TensorFlow plugin by using Apache TVM on M1☆136Updated 3 years ago
- Swift for NNC☆76Updated last week
- Print all known information about the GPU on Apple-designed chips☆86Updated 11 months ago
- ☆52Updated 3 years ago
- Running linear algebra as fast as possible on Apple silicon☆21Updated last year
- High-Performance SGEMM on CUDA devices☆98Updated 6 months ago
- Apple GPU microarchitecture☆540Updated 10 months ago
- Python bindings for ggml☆143Updated 11 months ago
- Emulating double-precision arithmetic on Apple GPUs☆55Updated 2 years ago
- Metal Shading Language on Apple M1's GPU for scientific C++.☆96Updated last year
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆286Updated last week
- The Foundation for All Legate Libraries☆221Updated this week
- Exploring the scalable matrix extension of the Apple M4 processor☆194Updated 9 months ago
- Graph Neural Network library made for Apple Silicon☆193Updated this week
- Software library for FDTD of viscoelastic equation using a staggered grid arrangement with support for GPU and CPU backends☆55Updated 5 months ago
- LLM training in simple, raw C/CUDA☆103Updated last year
- Efficient framework-agnostic data loading☆430Updated 2 months ago
- A thin, highly portable toolkit for efficiently compiling dense loop-based computation.☆148Updated 2 years ago
- Composable Function Transformations in Python with Mojo/MAX acceleration☆270Updated this week
- Swift for TensorFlow☆31Updated 3 years ago