Exploring the scalable matrix extension of the Apple M4 processor
☆222Nov 7, 2024Updated last year
Alternatives and similar repositories for m4-sme-exploration
Users that are interested in m4-sme-exploration are comparing it to the libraries listed below
Sorting:
- ☆33Mar 31, 2025Updated 11 months ago
- Apple AMX Instruction Set☆1,198Dec 26, 2024Updated last year
- Interactive GUI Snowfall Simulation Created in C & Raylib☆23Dec 24, 2025Updated 2 months ago
- CPU micro benchmarks☆76Feb 11, 2026Updated 3 weeks ago
- Apple GPU microarchitecture☆579Sep 22, 2024Updated last year
- Everything we actually know about the Apple Neural Engine (ANE)☆2,408Oct 21, 2025Updated 4 months ago
- Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning☆25May 12, 2025Updated 9 months ago
- HypergraphZ - A Hypergraph Implementation in Zig☆113Jan 10, 2026Updated 2 months ago
- Distributed SDDMM Kernel☆12Jul 8, 2022Updated 3 years ago
- Nim reimplementation of gron tool☆11Oct 9, 2024Updated last year
- LLM tokenizer in Zig☆15Dec 7, 2025Updated 3 months ago
- Binary Ninja Plugin for RISC-V☆14Nov 29, 2023Updated 2 years ago
- A simple to use C++ file library☆11Mar 2, 2022Updated 4 years ago
- ☆11Dec 1, 2023Updated 2 years ago
- ☆312Feb 6, 2026Updated last month
- Nvidia Instruction Set Specification Generator☆314Jul 9, 2024Updated last year
- Coding in colors☆13May 11, 2022Updated 3 years ago
- (only somewhat realistic) Simulation of a black hole. Uses OpenCL and raytracing, paired with some gravitational simulation, to simulate …☆12Jan 16, 2023Updated 3 years ago
- Reproducibility package for "Robust Join Processing with Diamond Hardened Joins"☆12Jul 10, 2024Updated last year
- Pass Rust strings to C with potentially not needing heap allocation☆13Jan 25, 2026Updated last month
- Zero allocation macros for retrieving multiple mutable indices from a mutable slice safely.☆15Jul 21, 2024Updated last year
- Audio Spectrum with SFML and FFTW☆13Dec 14, 2024Updated last year
- Stub for polymorphic code☆11Mar 18, 2023Updated 2 years ago
- ☆11Sep 11, 2023Updated 2 years ago
- Note taken while toying with VisionFive 2☆12Feb 16, 2023Updated 3 years ago
- ☆16Dec 11, 2024Updated last year
- Reference implementation of the draft C++ GraphBLAS specification.☆32Feb 19, 2025Updated last year
- Experimental fork of zlib with performance improvements☆35Apr 1, 2023Updated 2 years ago
- Example for running IREE in a bare-metal Arm environment.☆40Feb 24, 2026Updated 2 weeks ago
- BLIS fork with kernels for Apple M1. (Perhaps) The first open-source BLAS with Apple Matrix Coprocessor support.☆36Jan 7, 2023Updated 3 years ago
- Apple Firestorm/Icestorm CPU microarchitecture docs☆252Jul 13, 2023Updated 2 years ago
- x86-64, ARM, and RVV intrinsics viewer☆76Feb 15, 2026Updated 3 weeks ago
- The University of Bristol HPC Simulation Engine☆104Aug 30, 2025Updated 6 months ago
- Bomb Jack arcade resources☆67Updated this week
- Multi-branch model for concurrent execution☆18Jun 27, 2023Updated 2 years ago
- A simple yet high performance web server written with epoll and pure c.☆18Jun 7, 2019Updated 6 years ago
- Control your Roku with hand gestures using Mediapipe and Python☆17Dec 5, 2024Updated last year
- A game where you need to protect the Shehzadi from an onslaught of ants!☆16Sep 13, 2024Updated last year
- ☆1,078May 18, 2025Updated 9 months ago