noppoMan / python-metal-benchmarkLinks
An experimental repo for accessing Metal API from Python (OSX Only)
☆23Updated 5 years ago
Alternatives and similar repositories for python-metal-benchmark
Users that are interested in python-metal-benchmark are comparing it to the libraries listed below
Sorting:
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆149Updated 3 years ago
- Emulating double-precision arithmetic on Apple GPUs☆58Updated 2 years ago
- A python library to run metal compute kernels on macOS☆87Updated last year
- FlashAttention (Metal Port)☆579Updated last year
- Playing with the Metal Performance Shaders matrix multiplication kernel☆27Updated 8 years ago
- ☆55Updated last year
- Renderer for molecular nanotechnology☆90Updated 3 weeks ago
- Python + Apple Metal Framework = ?☆34Updated 7 years ago
- ☆54Updated 4 years ago
- Print all known information about the GPU on Apple-designed chips☆95Updated 3 months ago
- 3X speedup over Apple’s TensorFlow plugin by using Apache TVM on M1☆137Updated 3 years ago
- Running linear algebra as fast as possible on Apple silicon☆28Updated 2 years ago
- C API for MLX☆172Updated this week
- Convert StableHLO models into Apple Core ML format☆21Updated last week
- Metal Shading Language on Apple M1's GPU for scientific C++.☆106Updated 2 years ago
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆107Updated last month
- ☆29Updated last year
- Apple GPU microarchitecture☆578Updated last year
- Swift for NNC☆78Updated last week
- Exploring the scalable matrix extension of the Apple M4 processor☆220Updated last year
- Software library for FDTD of viscoelastic equation using a staggered grid arrangement with support for GPU and CPU backends☆58Updated last week
- Scientific computing with Metal in C++: Matrix multiplication example☆47Updated 3 years ago
- Implementation of Karpathy's micrograd in Mojo☆77Updated 2 years ago
- Python bindings for ggml☆147Updated last year
- Training MLP on MNIST in 1.5 seconds with pure CUDA☆46Updated last year
- NABLA - Distributed Deep Learning☆318Updated last week
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆215Updated 2 years ago
- AutoBound automatically computes upper and lower bounds on functions.☆364Updated 3 months ago
- MLIR tools and dialect for GraphBLAS☆18Updated 3 years ago
- Deep Learning Primitives and Mini-Framework for OpenCL☆205Updated last year