tlkh / m1-cpu-benchmarksLinks
☆52Updated 3 years ago
Alternatives and similar repositories for m1-cpu-benchmarks
Users that are interested in m1-cpu-benchmarks are comparing it to the libraries listed below
Sorting:
- TensorFlow Metal Backend on Apple Silicon Experiments (just for fun)☆278Updated 3 years ago
- 3X speedup over Apple’s TensorFlow plugin by using Apache TVM on M1☆136Updated 3 years ago
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆106Updated 6 months ago
- Print all known information about the GPU on Apple-designed chips☆85Updated 10 months ago
- Emulating double-precision arithmetic on Apple GPUs☆55Updated 2 years ago
- ☆94Updated 2 years ago
- An experimental repo for accessing Metal API from Python (OSX Only)☆23Updated 5 years ago
- Running linear algebra as fast as possible on Apple silicon☆21Updated last year
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆144Updated 2 years ago
- benchmarking some transformer deployments☆26Updated 2 years ago
- Exploring the scalable matrix extension of the Apple M4 processor☆187Updated 8 months ago
- Research publication code for "Least Squares Binary Quantization of Neural Networks"☆83Updated 2 years ago
- ☆27Updated 3 months ago
- Spying on Apple’s new predictive text model☆136Updated last year
- Run transformers (incl. LLMs) on the Apple Neural Engine.☆61Updated last year
- ☆113Updated this week
- A python library to run metal compute kernels on macOS☆80Updated 6 months ago
- C API for MLX☆118Updated last week
- Apple GPU microarchitecture☆532Updated 9 months ago
- MLX support for the Open Neural Network Exchange (ONNX)☆53Updated last year
- ☆60Updated last year
- Deep Learning Primitives and Mini-Framework for OpenCL☆199Updated 10 months ago
- Bandwidth test for ROCm☆60Updated last week
- FlashAttention (Metal Port)☆511Updated 9 months ago
- Repository of model demos using TT-Buda☆62Updated 3 months ago
- Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.☆189Updated last month
- Training material for IPU users: tutorials, feature examples, simple applications☆86Updated 2 years ago
- Home for OctoML PyTorch Profiler☆113Updated 2 years ago
- AMD's graph optimization engine.☆230Updated this week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆43Updated 4 months ago