corsix / amx
Apple AMX Instruction Set
☆1,005Updated 6 months ago
Alternatives and similar repositories for amx:
Users that are interested in amx are comparing it to the libraries listed below
- Apple G13 GPU architecture docs and tools☆552Updated 7 months ago
- Apple GPU microarchitecture☆486Updated 2 months ago
- Exploring the scalable matrix extension of the Apple M4 processor☆149Updated last month
- Apple Firestorm/Icestorm CPU microarchitecture docs☆224Updated last year
- Reverse engineered Linux driver for the Apple Neural Engine (ANE).☆377Updated 9 months ago
- Dissecting the M1's GPU for 3D acceleration☆990Updated 2 years ago
- Reverse engineering Rosetta 2 on M1 Mac☆375Updated 3 years ago
- ☆252Updated 5 months ago
- Everything we actually know about the Apple Neural Engine (ANE)☆2,099Updated 2 months ago
- Nvidia Instruction Set Specification Generator☆230Updated 5 months ago
- ☆1,010Updated 2 weeks ago
- A tiny C header-only risc-v emulator.☆1,707Updated last week
- The fastest RISC-V sandbox☆638Updated this week
- ☆386Updated last week
- nsync is a C library that exports various synchronization primitives, such as mutexes☆1,088Updated 4 months ago
- MLIR For Beginners tutorial☆845Updated 2 months ago
- C++ template library for high performance SIMD based sorting algorithms☆899Updated 2 weeks ago
- Kernel extension that enables TSO for Apple silicon processes☆254Updated last year
- The RISC-V Virtual Machine☆953Updated 2 weeks ago
- Vulkan/CUDA/HIP/OpenCL/Level Zero/Metal Fast Fourier Transform library☆1,562Updated 2 months ago
- advanced compilers☆767Updated 3 months ago
- ☆290Updated 7 months ago
- GPUOcelot: A dynamic compilation framework for PTX☆153Updated this week
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆409Updated last year
- Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.☆638Updated last week
- Contains the source code examples described in the "Intel® 64 and IA-32 Architectures Optimization Reference Manual"☆785Updated 7 months ago
- A self-hosting and educational C optimizing compiler☆1,146Updated this week
- GPU-accelerated compiler☆339Updated 8 months ago
- Implementations of SIMD instruction sets for systems which don't natively support them.☆2,462Updated this week
- Assembler for NVIDIA Maxwell architecture☆956Updated last year