Apple AMX Instruction Set
☆1,235Dec 26, 2024Updated last year
Alternatives and similar repositories for amx
Users that are interested in amx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Apple G13 GPU architecture docs and tools☆667May 16, 2025Updated last year
- Rust wrapper for Apple Matrix Coprocessor (AMX) instructions☆55Nov 14, 2023Updated 2 years ago
- Apple GPU microarchitecture☆613Sep 22, 2024Updated last year
- Exploring the scalable matrix extension of the Apple M4 processor☆231Nov 7, 2024Updated last year
- Apple Firestorm/Icestorm CPU microarchitecture docs☆260Jul 13, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Running linear algebra as fast as possible on Apple silicon☆29Aug 18, 2023Updated 2 years ago
- Everything we actually know about the Apple Neural Engine (ANE)☆2,477Mar 12, 2026Updated 3 months ago
- BLIS fork with kernels for Apple M1. (Perhaps) The first open-source BLAS with Apple Matrix Coprocessor support.☆36Jan 7, 2023Updated 3 years ago
- ☆34Mar 31, 2025Updated last year
- An introduction to ARM64 assembly on Apple Silicon Macs☆4,975May 15, 2026Updated 3 weeks ago
- ☆369Sep 25, 2025Updated 8 months ago
- Performance-portable, length-agnostic SIMD with runtime dispatch☆5,616Updated this week
- FlashAttention (Metal Port)☆605Sep 22, 2024Updated last year
- mold: A Modern Linker 🦠☆16,568Jun 7, 2026Updated last week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Implementations of SIMD instruction sets for systems which don't natively support them.☆3,039Jun 4, 2026Updated last week
- Reverse engineered Linux driver for the Apple Neural Engine (ANE).☆489Mar 12, 2024Updated 2 years ago
- Kernel extension that enables TSO for Apple silicon processes☆271Jun 18, 2023Updated 2 years ago
- tiniest x86-64-linux emulator☆7,510Dec 10, 2025Updated 6 months ago
- Tool for messing around with Apple GPU assembly☆27Jan 24, 2021Updated 5 years ago
- A bootloader and experimentation playground for Apple Silicon☆4,101Jun 7, 2026Updated last week
- Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)☆2,719Apr 25, 2023Updated 3 years ago
- Extract Metal functions from .metallib files.☆185May 24, 2023Updated 3 years ago
- A collection of reverse engineered Apple things, as well as a machine-readable database of Apple hardware☆1,334Jan 10, 2026Updated 5 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Decompiling macOS Hypervisor.framework by hand☆135Sep 13, 2022Updated 3 years ago
- LZBITMAP compression library☆56Jan 18, 2023Updated 3 years ago
- Dissecting the M1's GPU for 3D acceleration☆1,022Apr 4, 2022Updated 4 years ago
- You like pytorch? You like micrograd? You love tinygrad! ❤️☆33,035Updated this week
- Reverse engineering Rosetta 2 on M1 Mac☆432Aug 3, 2021Updated 4 years ago
- MLX: An array framework for Apple silicon☆26,868Updated this week
- RSD: RISC-V Out-of-Order Superscalar Processor☆1,181Feb 21, 2026Updated 3 months ago
- Measures the latency between CPU cores☆1,351Mar 25, 2026Updated 2 months ago
- Apple Silicon NOR dumper☆50Nov 8, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆12Dec 31, 2020Updated 5 years ago
- ☆1,506Jul 22, 2022Updated 3 years ago
- A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation☆1,512Apr 12, 2026Updated 2 months ago
- Dump Apple PMU counter definitions from `/usr/share/kpep` in macOS☆16Mar 25, 2026Updated 2 months ago
- Optimized implementations of various library functions for ARM architecture processors☆698May 18, 2026Updated 3 weeks ago
- iPhone 11 emulated on QEMU☆2,203Oct 22, 2022Updated 3 years ago
- CLI Tools For ANE☆127Mar 25, 2021Updated 5 years ago