corsix / amxLinks
Apple AMX Instruction Set
☆1,193Updated last year
Alternatives and similar repositories for amx
Users that are interested in amx are comparing it to the libraries listed below
Sorting:
- Apple G13 GPU architecture docs and tools☆642Updated 8 months ago
- Apple GPU microarchitecture☆578Updated last year
- Exploring the scalable matrix extension of the Apple M4 processor☆221Updated last year
- Reverse engineered Linux driver for the Apple Neural Engine (ANE).☆454Updated last year
- Apple Firestorm/Icestorm CPU microarchitecture docs☆251Updated 2 years ago
- Dissecting the M1's GPU for 3D acceleration☆1,018Updated 3 years ago
- ☆312Updated 4 months ago
- ☆451Updated 10 months ago
- Everything we actually know about the Apple Neural Engine (ANE)☆2,357Updated 3 months ago
- Nvidia Instruction Set Specification Generator☆311Updated last year
- Reverse engineering Rosetta 2 on M1 Mac☆426Updated 4 years ago
- ☆1,074Updated 8 months ago
- Kernel extension that enables TSO for Apple silicon processes☆265Updated 2 years ago
- Sniff CUDA ioctls☆224Updated 2 years ago
- BLIS fork with kernels for Apple M1. (Perhaps) The first open-source BLAS with Apple Matrix Coprocessor support.☆36Updated 3 years ago
- Exocompilation for productive programming of hardware accelerators☆708Updated this week
- ☆296Updated last year
- GPUOcelot: A dynamic compilation framework for PTX☆219Updated last year
- ☆88Updated 2 weeks ago
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆377Updated 9 months ago
- The fastest RISC-V sandbox☆1,023Updated 2 weeks ago
- Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.☆754Updated last week
- MLIR For Beginners tutorial☆1,220Updated 6 months ago
- Measures the latency between CPU cores☆1,323Updated last year
- A new (MLIR based) high-level IR for clang.☆588Updated this week
- TT-NN operator library, and TT-Metalium low level kernel programming model.☆1,343Updated this week
- Running linear algebra as fast as possible on Apple silicon☆28Updated 2 years ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆567Updated 2 years ago
- advanced compilers☆890Updated last month
- Circuit IR Compilers and Tools☆2,025Updated this week