Example code for Intel AVX / AVX2 intrinsics.
☆146Sep 18, 2023Updated 2 years ago
Alternatives and similar repositories for AVX-AVX2-Example-Code
Users that are interested in AVX-AVX2-Example-Code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Short examples illustrating AVX2 intrinsics for simple tasks.☆101Mar 13, 2024Updated 2 years ago
- A test library for computing modular exponentiation in parallel using AVX-512 vector arithmetic☆12Dec 18, 2023Updated 2 years ago
- Rebuild YatSenOS On RISC-V 64.☆23Jan 6, 2022Updated 4 years ago
- Advanced Vector Extensions (AVX) basic tutorial☆38Jun 11, 2026Updated last week
- A Method for efficiently processing SpMV using SIMD and load balancing☆17Apr 4, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A simple demonstration of how PyTorch autograd works☆16Sep 23, 2021Updated 4 years ago
- Artifact for PPoPP 2018 paper "Making Pull-Based Graph Processing Performant"☆23Apr 23, 2020Updated 6 years ago
- Software optimized implementations of GIFT and GIFT-COFB☆18Mar 29, 2022Updated 4 years ago
- Portable wrapper for SIMD and vector instructions written in C++11. Compatible with NEON, SSE, AVX, AVX-512 and SVE (length specific).☆528Apr 15, 2026Updated 2 months ago
- ☆22Aug 14, 2024Updated last year
- Documentation for YatCPU☆55Nov 15, 2023Updated 2 years ago
- Assignments for the cryptography engineering course☆12Dec 17, 2013Updated 12 years ago
- 基于muduo+protobuf+zookeeper的一个rpc分布式网络通信框架☆15May 31, 2023Updated 3 years ago
- Example code for Intel AVX / AVX2 intrinsics.☆21Oct 6, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10May 21, 2020Updated 6 years ago
- SIMD Vectorized implementation of X25519, Ed25519, X448 and Ed448☆33Mar 10, 2025Updated last year
- QCD for Intel Xeon Phi and Xeon processors☆14Mar 20, 2024Updated 2 years ago
- Hardware implementation of Saber☆10Jul 14, 2020Updated 5 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Sep 26, 2019Updated 6 years ago
- Vector class library, latest version☆1,456Apr 14, 2026Updated 2 months ago
- a pytorch implementation of Google GEDLoss☆32Dec 9, 2020Updated 5 years ago
- Saber and NTRU on M4 and AVX2☆19Jan 15, 2022Updated 4 years ago
- ☆99Feb 10, 2017Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆2,015Jul 29, 2023Updated 2 years ago
- MLKEM implementation optimized for embedded microcontrollers☆30Dec 1, 2025Updated 6 months ago
- Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.☆18Jun 25, 2021Updated 4 years ago
- SpMV using CUDA☆20Mar 5, 2018Updated 8 years ago
- 中山大学计算机网络实验 (2019 春):配置实验、编程实验、“小溪网”理论练习题☆51Dec 17, 2020Updated 5 years ago
- We use Loyalty smart contract to keep track of points earned and redeem by a member of the Synergy Loyalty program. Besides, we hold for …☆11Feb 3, 2023Updated 3 years ago
- SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.☆29Mar 16, 2021Updated 5 years ago
- Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…☆13Aug 12, 2022Updated 3 years ago
- Testing AVX capabilities with GCC☆11Jan 24, 2016Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A CPU tool for benchmarking the peak of floating points☆584May 4, 2026Updated last month
- Algorand's reference implementation of bls signature scheme☆14Sep 7, 2020Updated 5 years ago
- I would like to share my collection and my homework in SYSU Computer Science courses and elected courses.☆70Jul 6, 2023Updated 2 years ago
- A GPU FP32 computation method with Tensor Cores.☆27Dec 8, 2025Updated 6 months ago
- FAROS: A Framework for Benchmarking and Analysis of Compiler Optimization☆12Dec 23, 2022Updated 3 years ago
- Using C++ magic to capture CUDA kernels and tune them with Kernel Tuner☆21Sep 12, 2025Updated 9 months ago
- Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.☆163Feb 3, 2022Updated 4 years ago