corsix / amxLinks
Apple AMX Instruction Set
☆1,093Updated 5 months ago
Alternatives and similar repositories for amx
Users that are interested in amx are comparing it to the libraries listed below
Sorting:
- Exploring the scalable matrix extension of the Apple M4 processor☆178Updated 7 months ago
- Apple G13 GPU architecture docs and tools☆593Updated last month
- Apple GPU microarchitecture☆526Updated 9 months ago
- Apple Firestorm/Icestorm CPU microarchitecture docs☆241Updated last year
- Reverse engineered Linux driver for the Apple Neural Engine (ANE).☆416Updated last year
- Nvidia Instruction Set Specification Generator☆278Updated 11 months ago
- Dissecting the M1's GPU for 3D acceleration☆1,009Updated 3 years ago
- ☆283Updated 5 months ago
- Reverse engineering Rosetta 2 on M1 Mac☆406Updated 3 years ago
- ☆1,041Updated last month
- GPUOcelot: A dynamic compilation framework for PTX☆194Updated 4 months ago
- ☆296Updated last year
- A new (MLIR based) high-level IR for clang.☆506Updated this week
- MLIR For Beginners tutorial☆998Updated 4 months ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆505Updated 2 years ago
- Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.☆703Updated 3 weeks ago
- The RISC-V Virtual Machine☆1,060Updated this week
- Trying to figure various CPU things out☆136Updated this week
- The fastest RISC-V sandbox☆884Updated this week
- BLIS fork with kernels for Apple M1. (Perhaps) The first open-source BLAS with Apple Matrix Coprocessor support.☆35Updated 2 years ago
- Measures the latency between CPU cores☆1,224Updated 10 months ago
- C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!☆553Updated this week
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆351Updated 2 months ago
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆103Updated 3 months ago
- Sniff CUDA ioctls☆195Updated 2 years ago
- This repository contains high-performance implementations of memset and memcpy in assembly.☆331Updated 3 years ago
- An introduction to ARM64 assembly on Apple Silicon Macs☆4,697Updated 2 months ago
- ☆1,542Updated this week
- Circuit IR Compilers and Tools☆1,845Updated this week
- Exocompilation for productive programming of hardware accelerators☆607Updated this week