Short examples illustrating AVX2 intrinsics for simple tasks.
☆98Mar 13, 2024Updated 2 years ago
Alternatives and similar repositories for avx2-examples
Users that are interested in avx2-examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Example code for Intel AVX / AVX2 intrinsics.☆145Sep 18, 2023Updated 2 years ago
- Dictionary compressor with nibbled ANS and optimal parsing. Other compression experiments.☆26Apr 13, 2025Updated 11 months ago
- Compiler plugin for performance analysis of HIP applications☆13Apr 7, 2025Updated 11 months ago
- This is no longer maintained. Please visit StreamHPC's fork https://github.com/StreamHPC/FinanceBench☆43Apr 20, 2018Updated 7 years ago
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆32Aug 2, 2022Updated 3 years ago
- An implementation of SGEMV with performance comparable to cuBLAS.☆12May 21, 2021Updated 4 years ago
- The ultimate bandwidth benchmark☆62Dec 16, 2025Updated 3 months ago
- cuDTW++: Ultra-Fast Dynamic Time Warping on CUDA-enabled GPUs☆33May 11, 2020Updated 5 years ago
- collection of used Tools for CTFs☆15Oct 24, 2021Updated 4 years ago
- CP-ABE测试加解密操作和密钥生成操作的性能☆11Jun 24, 2020Updated 5 years ago
- The vOW4SIKE project provides C code that implements the parallel collision search algorithm by van Oorschot and Wiener (vOW). The algori…☆12May 25, 2021Updated 4 years ago
- Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…☆12Aug 12, 2022Updated 3 years ago
- Fast 4 way vectorized ladder for the complete set of Montgomery curves☆11Feb 13, 2019Updated 7 years ago
- Measure instruction latency and throughput☆31Sep 2, 2025Updated 6 months ago
- Elastic and fault tolerant parallel map and parallel map reduce methods. Part of the COFII framework.☆16Aug 6, 2025Updated 7 months ago
- Anonymous Credit Tokens implementation in Rust☆26Mar 2, 2026Updated 3 weeks ago
- Slurm Examples☆10Aug 30, 2024Updated last year
- A Benchmark Toolkit for Assembly Instructions Using the LLVM JIT☆17Oct 26, 2020Updated 5 years ago
- TVM learning and research☆13Jan 8, 2021Updated 5 years ago
- ☆11Mar 15, 2023Updated 3 years ago
- A collaborative machine learning framework that operates through Tor.☆13Jun 1, 2020Updated 5 years ago
- 🍒 A massif (Valgrind) extension to analyze partial memory consumptions☆22Dec 6, 2016Updated 9 years ago
- Picorv32 SoC on the TinyFPGA BX, for games etc.☆12Sep 22, 2018Updated 7 years ago
- Mining CryptoNight Haven on the Varium C1100☆10Apr 1, 2022Updated 3 years ago
- Variadic recursive expression templates with lazy evaluation which look like ordinary (possibly nested) containers.☆17Feb 5, 2023Updated 3 years ago
- A simple cycle accurate template model for ASIC/FPGA hardware design. Including a cycle accurate FIFO design example. More designs are co…☆17Sep 5, 2019Updated 6 years ago
- HCC Sample Applications☆13Jan 3, 2017Updated 9 years ago
- Baseline solution for ADC2023 ECML competition☆11Apr 14, 2023Updated 2 years ago
- Implement asm gemm on vega64 for 4096x4096 fp32 matrix☆22Oct 12, 2019Updated 6 years ago
- Presentation materials for the 2016 Berkeley C++ Summit☆14Oct 20, 2016Updated 9 years ago
- ☆12Feb 9, 2026Updated last month
- Fast streams for block gzip files.☆14Nov 11, 2025Updated 4 months ago
- 区块链捐赠溯源平台☆13Jul 29, 2020Updated 5 years ago
- ☆13Sep 19, 2024Updated last year
- Implementation of Brakerski's leveled homomorphic encryption system☆44Feb 12, 2017Updated 9 years ago
- ☆13Jun 12, 2021Updated 4 years ago
- ☆24Oct 17, 2016Updated 9 years ago
- C++ RandomForest☆12Jan 31, 2015Updated 11 years ago
- A intelligent matrix format designer for SpMV☆10Oct 10, 2023Updated 2 years ago