corsix / amxLinks
Apple AMX Instruction Set
☆1,098Updated 6 months ago
Alternatives and similar repositories for amx
Users that are interested in amx are comparing it to the libraries listed below
Sorting:
- Apple G13 GPU architecture docs and tools☆596Updated last month
- Apple GPU microarchitecture☆530Updated 9 months ago
- Exploring the scalable matrix extension of the Apple M4 processor☆184Updated 8 months ago
- Apple Firestorm/Icestorm CPU microarchitecture docs☆242Updated 2 years ago
- ☆286Updated 6 months ago
- Nvidia Instruction Set Specification Generator☆280Updated last year
- ☆448Updated 3 months ago
- Dissecting the M1's GPU for 3D acceleration☆1,009Updated 3 years ago
- Sniff CUDA ioctls☆196Updated 2 years ago
- ☆1,043Updated last month
- GPUOcelot: A dynamic compilation framework for PTX☆201Updated 5 months ago
- Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.☆706Updated last month
- Exocompilation for productive programming of hardware accelerators☆640Updated this week
- Circuit IR Compilers and Tools☆1,855Updated this week
- MLIR For Beginners tutorial☆1,009Updated 5 months ago
- The fastest RISC-V sandbox☆895Updated 3 weeks ago
- A new (MLIR based) high-level IR for clang.☆510Updated this week
- TT-NN operator library, and TT-Metalium low level kernel programming model.☆991Updated this week
- C++ template library for high performance SIMD based sorting algorithms☆956Updated 3 weeks ago
- nsync is a C library that exports various synchronization primitives, such as mutexes☆1,183Updated 3 months ago
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆350Updated 2 months ago
- Measures the latency between CPU cores☆1,233Updated 11 months ago
- ☆47Updated this week
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆515Updated 2 years ago
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆105Updated 4 months ago
- Implementations of SIMD instruction sets for systems which don't natively support them.☆2,732Updated this week
- RDNA3 emulator☆54Updated 2 months ago
- advanced compilers☆841Updated this week
- The RISC-V Virtual Machine☆1,068Updated 2 weeks ago
- C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!☆564Updated 3 weeks ago