tlkh / m1-cpu-benchmarksLinks

☆52

Alternatives and similar repositories for m1-cpu-benchmarks

Users that are interested in m1-cpu-benchmarks are comparing it to the libraries listed below

Sorting:

tlkh / tf-metal-experiments
TensorFlow Metal Backend on Apple Silicon Experiments (just for fun)
☆278Updated 3 years ago
octoml / Apple-M1-BERT
3X speedup over Apple’s TensorFlow plugin by using Apache TVM on M1
☆136Updated 3 years ago
nod-ai / SRT
Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …
☆106Updated 6 months ago
philipturner / applegpuinfo
Print all known information about the GPU on Apple-designed chips
☆85Updated 10 months ago
philipturner / metal-float64
Emulating double-precision arithmetic on Apple GPUs
☆55Updated 2 years ago
woolfel / ml-macos-performance
☆94Updated 2 years ago
noppoMan / python-metal-benchmark
An experimental repo for accessing Metal API from Python (OSX Only)
☆23Updated 5 years ago
philipturner / amx-benchmarks
Running linear algebra as fast as possible on Apple silicon
☆21Updated last year
ShoYamanishi / AppleNumericalComputing
Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices
☆144Updated 2 years ago
nod-ai / transformer-benchmarks
benchmarking some transformer deployments
☆26Updated 2 years ago
tzakharko / m4-sme-exploration
Exploring the scalable matrix extension of the Apple M4 processor
☆187Updated 8 months ago
apple / ml-quant
Research publication code for "Least Squares Binary Quantization of Neural Networks"
☆83Updated 2 years ago
scalable-analyses / sme
☆27Updated 3 months ago
jackcook / predictive-spy
Spying on Apple’s new predictive text model
☆136Updated last year
smpanaro / more-ane-transformers
Run transformers (incl. LLMs) on the Apple Neural Engine.
☆61Updated last year
amd / ZenDNN
☆113Updated this week
baldand / py-metal-compute
A python library to run metal compute kernels on macOS
☆80Updated 6 months ago
ml-explore / mlx-c
C API for MLX
☆118Updated last week
philipturner / metal-benchmarks
Apple GPU microarchitecture
☆532Updated 9 months ago
ml-explore / mlx-onnx
MLX support for the Open Neural Network Exchange (ONNX)
☆53Updated last year
amd / UIF
☆60Updated last year
artyom-beilis / dlprimitives
Deep Learning Primitives and Mini-Framework for OpenCL
☆199Updated 10 months ago
ROCm / rocm_bandwidth_test
Bandwidth test for ROCm
☆60Updated last week
philipturner / metal-flash-attention
FlashAttention (Metal Port)
☆511Updated 9 months ago
tenstorrent / tt-buda-demos
Repository of model demos using TT-Buda
☆62Updated 3 months ago
TristanBilot / mlx-benchmark
Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.
☆189Updated last month
graphcore / tutorials
Training material for IPU users: tutorials, feature examples, simple applications
☆86Updated 2 years ago
octoml / octoml-profile
Home for OctoML PyTorch Profiler
☆113Updated 2 years ago
ROCm / AMDMIGraphX
AMD's graph optimization engine.
☆230Updated this week
pytorch-labs / triton-cpu
An experimental CPU backend for Triton (https//github.com/openai/triton)
☆43Updated 4 months ago