Exploring the scalable matrix extension of the Apple M4 processor
☆231Nov 7, 2024Updated last year
Alternatives and similar repositories for m4-sme-exploration
Users that are interested in m4-sme-exploration are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆35Mar 31, 2025Updated last year
- Apple AMX Instruction Set☆1,237Dec 26, 2024Updated last year
- CPU micro benchmarks☆83Jun 19, 2026Updated last week
- Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning☆25May 12, 2025Updated last year
- ☆10Apr 24, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Utility to sign DXIL code after compilation☆22Feb 18, 2019Updated 7 years ago
- Apple GPU microarchitecture☆617Sep 22, 2024Updated last year
- ☆54Oct 31, 2021Updated 4 years ago
- Stub for polymorphic code☆11Mar 18, 2023Updated 3 years ago
- Rosetta2 AVX Implementation Deep Dive☆16Apr 4, 2025Updated last year
- BLIS fork with kernels for Apple M1. (Perhaps) The first open-source BLAS with Apple Matrix Coprocessor support.☆36Jan 7, 2023Updated 3 years ago
- Everything we actually know about the Apple Neural Engine (ANE)☆2,486Mar 12, 2026Updated 3 months ago
- Investigation into replacing the MES compiler☆35Jun 15, 2026Updated 2 weeks ago
- Apple Firestorm/Icestorm CPU microarchitecture docs☆261Jul 13, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- FP64 equivalent GEMM by the Ozaki scheme with Int8 Tensor Cores☆122Dec 2, 2025Updated 6 months ago
- ☆82Oct 29, 2024Updated last year
- ☆14Aug 4, 2021Updated 4 years ago
- Note taken while toying with VisionFive 2☆12Feb 16, 2023Updated 3 years ago
- An experimental IPC interface definition language for Hubris.☆28May 5, 2026Updated last month
- An easy-to-use and fast library for task-based parallelism, utilizing coroutines.☆336Sep 13, 2024Updated last year
- x64 assembler library in C☆26Oct 5, 2020Updated 5 years ago
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆965Updated this week
- Running linear algebra as fast as possible on Apple silicon☆30Aug 18, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Interactive GUI Snowfall Simulation Created in C & Raylib☆23Dec 24, 2025Updated 6 months ago
- ☆11Sep 11, 2023Updated 2 years ago
- Nvidia Instruction Set Specification Generator☆340Jul 9, 2024Updated last year
- experimental cooperative threading library for embedded ARM in pure C☆20Aug 18, 2021Updated 4 years ago
- The University of Bristol HPC Simulation Engine☆109Jun 17, 2026Updated last week
- x86-64, ARM, and RVV intrinsics viewer☆81Jun 18, 2026Updated last week
- GEMMul8 (GEMMulate): GEMM emulation and its extension to BLAS-like matrix operations using INT8/FP8 matrix engines based on the Ozaki Sch…☆81Updated this week
- ☆12Dec 1, 2023Updated 2 years ago
- Pass Rust strings to C with potentially not needing heap allocation☆13Jan 25, 2026Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An LLVM IR dataset for data-driven compiler optimization research☆80Mar 17, 2026Updated 3 months ago
- ☆15Dec 4, 2024Updated last year
- Zero allocation macros for retrieving multiple mutable indices from a mutable slice safely.☆16Jul 21, 2024Updated last year
- Syntax highlight code embedded in HTML with a splash of color. Also includes the auto-updated Chroma style gallery.☆38May 10, 2026Updated last month
- A Python based programming system for heterogeneous computing☆25Apr 29, 2025Updated last year
- An introduction to ARM64 assembly on Apple Silicon Macs