Example code for Intel AVX / AVX2 intrinsics.
☆145Sep 18, 2023Updated 2 years ago
Alternatives and similar repositories for AVX-AVX2-Example-Code
Users that are interested in AVX-AVX2-Example-Code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Short examples illustrating AVX2 intrinsics for simple tasks.☆99Mar 13, 2024Updated 2 years ago
- Fast 4 way vectorized ladder for the complete set of Montgomery curves☆11Feb 13, 2019Updated 7 years ago
- A test library for computing modular exponentiation in parallel using AVX-512 vector arithmetic☆12Dec 18, 2023Updated 2 years ago
- Advanced Vector Extensions (AVX) basic tutorial☆37Jun 10, 2021Updated 4 years ago
- A Method for efficiently processing SpMV using SIMD and load balancing☆17Apr 4, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 中山大学操作系统原理实验 (2019 春):GCC+NASM 实模式操作系统,共包含 7 个实验项目☆37Mar 21, 2021Updated 5 years ago
- Parallelized and vectorized SpMV on Intel Xeon Phi (Knights Landing, AVX512, KNL)☆24Feb 12, 2024Updated 2 years ago
- A simple demonstration of how PyTorch autograd works☆16Sep 23, 2021Updated 4 years ago
- Artifact for PPoPP 2018 paper "Making Pull-Based Graph Processing Performant"☆23Apr 23, 2020Updated 5 years ago
- ASM methods to test small loop performance on x86☆13Jun 11, 2019Updated 6 years ago
- Software optimized implementations of GIFT and GIFT-COFB☆18Mar 29, 2022Updated 4 years ago
- AVX-optimized sin(), cos(), exp() and log() functions☆131Jan 15, 2022Updated 4 years ago
- ☆23Aug 14, 2024Updated last year
- Documentation for YatCPU☆54Nov 15, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is a tuned sparse matrix dense vector multiplication(SpMV) library☆23Mar 21, 2016Updated 10 years ago
- ボイチェビキャラに監禁調教されるゲーム☆15Sep 13, 2020Updated 5 years ago
- Example code for Intel AVX / AVX2 intrinsics.☆21Oct 6, 2018Updated 7 years ago
- ☆10May 21, 2020Updated 5 years ago
- rv1126,rk1808,hisi3516dv300,hisi3559av100,mnn,nnie,npu,vfnet,retinaface,arcface,centernet,ttf,yolox,detect,alg. This is a cross-platform …☆33May 28, 2024Updated last year
- SIMD Vectorized implementation of X25519, Ed25519, X448 and Ed448☆32Mar 10, 2025Updated last year
- Hardware implementation of Saber☆10Jul 14, 2020Updated 5 years ago
- Updated! (Dec2-2019) This is a C-language software library that provides optimized implementations of the Diffie-Hellman functions known …☆44Nov 10, 2023Updated 2 years ago
- a pytorch implementation of Google GEDLoss☆32Dec 9, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆98Feb 10, 2017Updated 9 years ago
- ☆2,006Jul 29, 2023Updated 2 years ago
- CP-ABE测试加解密操作和密钥生成操作的性能☆11Jun 24, 2020Updated 5 years ago
- MLKEM implementation optimized for embedded microcontrollers☆28Dec 1, 2025Updated 4 months ago
- SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.☆29Mar 16, 2021Updated 5 years ago
- FELICS Framework☆11Dec 5, 2019Updated 6 years ago
- Exploration of NIST post-quantum signatures on-ramp candidates☆39Jun 1, 2025Updated 10 months ago
- Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…☆13Aug 12, 2022Updated 3 years ago
- Optimized assembly implementations of crypto for the RV32I (RISC-V) architecture☆32Oct 14, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Artifact of paper "Exploiting Recent SIMD Architectural Advances for Irregular Applications"☆11Jun 23, 2016Updated 9 years ago
- A CPU tool for benchmarking the peak of floating points☆579Feb 7, 2026Updated 2 months ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Apr 11, 2021Updated 5 years ago
- I would like to share my collection and my homework in SYSU Computer Science courses and elected courses.☆69Jul 6, 2023Updated 2 years ago
- A GPU FP32 computation method with Tensor Cores.☆26Dec 8, 2025Updated 4 months ago
- Using C++ magic to capture CUDA kernels and tune them with Kernel Tuner☆21Sep 12, 2025Updated 7 months ago
- Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.☆162Feb 3, 2022Updated 4 years ago