Triple-Z / AVX-AVX2-Example-CodeLinks
Example code for Intel AVX / AVX2 intrinsics.
☆138Updated last year
Alternatives and similar repositories for AVX-AVX2-Example-Code
Users that are interested in AVX-AVX2-Example-Code are comparing it to the libraries listed below
Sorting:
- Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA☆32Updated 4 years ago
- Short examples illustrating AVX2 intrinsics for simple tasks.☆95Updated last year
- Intel AVX-512简介☆49Updated last year
- CUDA PTX-ISA Document 中文翻译版☆42Updated 3 weeks ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆81Updated 5 years ago
- ☆44Updated 4 years ago
- Parallelized and vectorized SpMV on Intel Xeon Phi (Knights Landing, AVX512, KNL)☆24Updated last year
- Encapsulate the frequently used AVX instructions as independent modules to reduce repeated development workload.☆122Updated last year
- This is an implementation of sgemm_kernel on L1d cache.☆228Updated last year
- ☆113Updated last year
- ☆247Updated 2 weeks ago
- Dissecting NVIDIA GPU Architecture☆97Updated 2 years ago
- ☆98Updated last year
- An extension library of WMMA API (Tensor Core API)☆99Updated 11 months ago
- A highly efficient library for GEMM operations on Sunway TaihuLight☆17Updated 4 years ago
- Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.☆148Updated 3 years ago
- Some source code about matrix multiplication implementation on CUDA☆34Updated 6 years ago
- A 128 bit unsigned integer class for CUDA☆46Updated 5 months ago
- how to design cpu gemm on x86 with avx256, that can beat openblas.☆70Updated 6 years ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆83Updated 2 years ago
- ☆91Updated 8 years ago
- ☆62Updated 6 months ago
- ☆44Updated 4 years ago
- NUMA-aware multi-CPU multi-GPU data transfer benchmarks☆23Updated last year
- ☆96Updated 3 years ago
- assembler for NVIDIA FERMI. Imported from Google Code☆72Updated 10 years ago
- Assembler for NVIDIA Volta and Turing GPUs☆222Updated 3 years ago
- An Open Source Kepler GPU Assembler☆19Updated 8 years ago
- development repository for the open earth compiler☆80Updated 4 years ago
- IMPACT GPU Algorithms Teaching Labs☆57Updated 2 years ago