Running linear algebra as fast as possible on Apple silicon
☆29Aug 18, 2023Updated 2 years ago
Alternatives and similar repositories for amx-benchmarks
Users that are interested in amx-benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Swift package to retrieve realtime information on CPU energy consumption on Apple platforms using the CPU's Closed Loop Performance Contr…☆15Jun 15, 2024Updated last year
- Please note OpenFPM project structure change in version 5.0.0. For details refer to the main repo and website☆11Jan 19, 2026Updated 3 months ago
- Scientific Computing Benchmarks☆11Mar 13, 2020Updated 6 years ago
- BLIS fork with kernels for Apple M1. (Perhaps) The first open-source BLAS with Apple Matrix Coprocessor support.☆36Jan 7, 2023Updated 3 years ago
- OpenFPM: A scalable open framework for particle and particle-mesh codes on parallel computers☆23Apr 13, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A little library for using SIMD instructions for x86 and ARM, wrapping Agner Fog's vectorclass for x86 and filling some of its functional…☆17Dec 10, 2021Updated 4 years ago
- GEMM☆10Aug 26, 2023Updated 2 years ago
- Apple AMX Instruction Set☆1,213Dec 26, 2024Updated last year
- Musings in GEMM (General Matrix Multiplication)☆14Dec 14, 2025Updated 4 months ago
- Benchmark resources☆15Sep 27, 2023Updated 2 years ago
- A very simple C# wrapper around a subset of the Embree ray tracing kernels.☆14Apr 10, 2025Updated last year
- Flat sorted array with very fast insert and erase operations☆18Sep 26, 2025Updated 7 months ago
- An "explicit control" Lisp interpreter written in assembly-like C☆12Dec 31, 2017Updated 8 years ago
- A Julia cluster manager for Kubernetes☆33Apr 13, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆18Jun 21, 2024Updated last year
- A code sample demonstrating how to share and rebuild a PyTorch GPU tensor via its pointer/reference between different processes.☆15Aug 27, 2024Updated last year
- Tensor Tiling Library☆40Sep 23, 2025Updated 7 months ago
- ☆13Apr 12, 2026Updated 2 weeks ago
- ☆12Jul 3, 2023Updated 2 years ago
- Robust Global Illumination in 99 lines of C++☆12Sep 19, 2019Updated 6 years ago
- Small autodiff lib and a simple working feedforward neural net in Haskell on top of it, from scratch, zero-deps.☆16Jun 21, 2024Updated last year
- PPC instruction tests☆11Jan 22, 2024Updated 2 years ago
- ☆12Jan 19, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆19Sep 5, 2019Updated 6 years ago
- easter egg is a flexible, high-performance e-graph library with support of multiple additional assumptions at once☆13Mar 27, 2025Updated last year
- XNU kernel symbol resolver(kernel extension)☆12Mar 1, 2019Updated 7 years ago
- A high-order accurate piecewise polynomial reconstruction library.☆12May 3, 2022Updated 3 years ago
- Distributed SDDMM Kernel☆12Jul 8, 2022Updated 3 years ago
- core WebGPU shaders☆16Aug 18, 2024Updated last year
- ☆15Jan 25, 2026Updated 3 months ago
- Description of Apple's LEAP ISA☆16Nov 21, 2022Updated 3 years ago
- BVH accelerated CPU and OpenGL ray-tracing☆19Mar 13, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆152Dec 4, 2022Updated 3 years ago
- Tool for messing around with Apple GPU assembly☆27Jan 24, 2021Updated 5 years ago
- Range Optimized Adaptive Radix Tree☆25Feb 1, 2023Updated 3 years ago
- Distributed Communication-Optimal Shuffle and Transpose Algorithm☆14Apr 18, 2026Updated last week
- A tool designed to compare energy and emission costs between computer chips☆13Dec 9, 2023Updated 2 years ago
- Rust library for parsing a number of firmware images☆14Feb 22, 2023Updated 3 years ago
- A program that aims to measure the size of RAM an the characteristics of CPU Cache.☆13Nov 12, 2018Updated 7 years ago