Advanced Matrix Extensions (AMX) Guide
☆109Jan 11, 2022Updated 4 years ago
Alternatives and similar repositories for AMX-Guide
Users that are interested in AMX-Guide are comparing it to the libraries listed below
Sorting:
- Code samples related to Intel(R) AMX☆39Apr 8, 2024Updated last year
- RISC-V Integrated Matrix Development Repository☆21Updated this week
- OpenCCA: An Open Framework to Enable Arm CCA Research☆20Sep 10, 2025Updated 5 months ago
- ☆12Sep 18, 2024Updated last year
- VNEC: A Vectorized Non-Empty Column Format for SpMV on cross-platform multicore CPUs☆10Feb 6, 2024Updated 2 years ago
- RISC-V by VectorBlox☆11Jul 19, 2017Updated 8 years ago
- Roofline prototype for Arm☆14Mar 25, 2024Updated last year
- TYPCN Internal Software Communication Protocol☆12Jul 21, 2017Updated 8 years ago
- ☆10Oct 19, 2017Updated 8 years ago
- Handwritten GEMM using Intel AMX (Advanced Matrix Extension)☆17Jan 11, 2025Updated last year
- Parametric floating-point unit with support for standard RISC-V formats and operations as well as transprecision formats.☆18Nov 13, 2025Updated 3 months ago
- GoPTX: Fine-grained GPU Kernel Fusion by PTX-level Instruction Flow Weaving☆20Jul 30, 2025Updated 7 months ago
- A matrix extension proposal for AI applications under RISC-V architecture☆162Feb 11, 2025Updated last year
- An awesome curated list of languages and tools to program FPGAs☆73Jun 22, 2022Updated 3 years ago
- A collection of cryptographic algorthms implemented in SystemVerilog☆20Jun 7, 2018Updated 7 years ago
- matmul using AMX instructions☆23May 7, 2024Updated last year
- ☆19Feb 18, 2021Updated 5 years ago
- ☆26Jul 19, 2022Updated 3 years ago
- ☆58Feb 18, 2019Updated 7 years ago
- A WIP Float32 soft FPU implementation☆22Jun 25, 2021Updated 4 years ago
- Build an open source, extremely simple DMA.☆23Feb 17, 2019Updated 7 years ago
- Benchmark code for the "Online normalizer calculation for softmax" paper☆106Jul 27, 2018Updated 7 years ago
- Simulator code of the paper "Dissecting and Modeling the Architecture of Modern GPU Cores"☆64Oct 15, 2025Updated 4 months ago
- 为HSNW源码加上了详细的注释☆20Oct 26, 2022Updated 3 years ago
- ☆28Jun 17, 2025Updated 8 months ago
- ☆30Dec 11, 2025Updated 2 months ago
- x86-64, ARM, and RVV intrinsics viewer☆76Feb 15, 2026Updated 2 weeks ago
- A multi-platform file-configurable folder comparison tool with html-reporting written in rust☆12Feb 13, 2026Updated 2 weeks ago
- ☆24Jan 6, 2023Updated 3 years ago
- Code to evaluate XLATE attacks as well existing cache attacks.☆31Aug 17, 2018Updated 7 years ago
- Implementation of IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs (ICLR 2024).☆25Feb 22, 2026Updated last week
- ☆64Dec 4, 2022Updated 3 years ago
- ☆75Apr 18, 2025Updated 10 months ago
- ☆19Jan 14, 2026Updated last month
- Data Structures and Algorithms. Contribute and Learn together.☆11Oct 11, 2022Updated 3 years ago
- Verilog/SystemVerilog Guide☆80Jan 4, 2024Updated 2 years ago
- General Purpose Timing Library☆34Aug 17, 2025Updated 6 months ago
- ☆80Sep 4, 2024Updated last year
- A floating-point matrix multiplication implemented in hardware☆32Jan 5, 2021Updated 5 years ago