tlkh / m1-cpu-benchmarks
☆50Updated 3 years ago
Alternatives and similar repositories for m1-cpu-benchmarks:
Users that are interested in m1-cpu-benchmarks are comparing it to the libraries listed below
- ☆93Updated last year
- TensorFlow Metal Backend on Apple Silicon Experiments (just for fun)☆277Updated 3 years ago
- An experimental repo for accessing Metal API from Python (OSX Only)☆22Updated 4 years ago
- Horizon chart for CPU/GPU/Neural Engine utilization monitoring on Apple M1/M2 and nVidia GPUs on Linux☆25Updated 3 weeks ago
- 3X speedup over Apple’s TensorFlow plugin by using Apache TVM on M1☆135Updated 2 years ago
- ☆32Updated last year
- Run transformers (incl. LLMs) on the Apple Neural Engine.☆58Updated last year
- ☆106Updated 3 weeks ago
- benchmarking some transformer deployments☆26Updated 2 years ago
- Exploring the scalable matrix extension of the Apple M4 processor☆168Updated 4 months ago
- Print all known information about the GPU on Apple-designed chips☆74Updated 7 months ago
- SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi☆38Updated last month
- Run a CoreML MLModel on the Asahi Neural Engine☆47Updated last year
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆106Updated 2 months ago
- Notes and artifacts from the ONNX steering committee☆25Updated this week
- Running linear algebra as fast as possible on Apple silicon☆19Updated last year
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆39Updated last week
- oneCCL Bindings for Pytorch*☆90Updated last week
- Data Parallel Extension for NumPy☆104Updated this week
- Benchmarks to capture important workloads.☆30Updated last month
- Sudoless Asitop☆56Updated 8 months ago
- Research publication code for "Least Squares Binary Quantization of Neural Networks"☆83Updated 2 years ago
- TensorFlow-nGraph bridge☆136Updated 4 years ago
- NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions☆25Updated last week
- ☆21Updated 5 months ago
- GPUOcelot: A dynamic compilation framework for PTX☆181Updated last month
- See the device (CPU/GPU/ANE) and estimated cost for every layer in your CoreML model.☆22Updated 9 months ago
- ☆45Updated last month
- Bandwidth test for ROCm☆54Updated last week
- Emulating double-precision arithmetic on Apple GPUs☆49Updated last year