noppoMan / python-metal-benchmarkLinks
An experimental repo for accessing Metal API from Python (OSX Only)
☆23Updated 5 years ago
Alternatives and similar repositories for python-metal-benchmark
Users that are interested in python-metal-benchmark are comparing it to the libraries listed below
Sorting:
- A python library to run metal compute kernels on macOS☆86Updated 11 months ago
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆149Updated 3 years ago
- Python + Apple Metal Framework = ?☆34Updated 7 years ago
- Print all known information about the GPU on Apple-designed chips☆95Updated 2 months ago
- Apple GPU microarchitecture☆569Updated last year
- FlashAttention (Metal Port)☆572Updated last year
- Playing with the Metal Performance Shaders matrix multiplication kernel☆27Updated 8 years ago
- Emulating double-precision arithmetic on Apple GPUs☆58Updated 2 years ago
- Renderer for molecular nanotechnology☆88Updated last week
- 3X speedup over Apple’s TensorFlow plugin by using Apache TVM on M1☆137Updated 3 years ago
- Metal Shading Language on Apple M1's GPU for scientific C++.☆106Updated 2 years ago
- C API for MLX☆158Updated this week
- ☆55Updated last year
- Running linear algebra as fast as possible on Apple silicon☆27Updated 2 years ago
- Atomistic Spin Simulation Framework☆66Updated 5 years ago
- Implementation of Karpathy's micrograd in Mojo☆78Updated 2 years ago
- Software library for FDTD of viscoelastic equation using a staggered grid arrangement with support for GPU and CPU backends☆56Updated this week
- ☆54Updated 4 years ago
- LLM inference in Fortran☆65Updated last year
- The Foundation for All Legate Libraries☆233Updated this week
- Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.☆209Updated last week
- Swift for NNC☆78Updated this week
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆55Updated last week
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆107Updated 3 weeks ago
- Convert StableHLO models into Apple Core ML format☆21Updated 3 weeks ago
- High-Performance SGEMM on CUDA devices☆115Updated 11 months ago
- Exploring the scalable matrix extension of the Apple M4 processor☆213Updated last year
- Machine Learning library for the emerging Mojo/Python ecosystem☆305Updated this week
- Scientific computing with Metal in C++: Matrix multiplication example☆45Updated 3 years ago
- ☆58Updated 2 years ago