VictorRodriguez / AVX-SGLinks
Advanced Vector Extensions (AVX) basic tutorial
☆37Updated 4 years ago
Alternatives and similar repositories for AVX-SG
Users that are interested in AVX-SG are comparing it to the libraries listed below
Sorting:
- Example code for Intel AVX / AVX2 intrinsics.☆144Updated 2 years ago
- Short examples illustrating AVX2 intrinsics for simple tasks.☆98Updated last year
- ☆97Updated 8 years ago
- Parallel Memory Bandwidth Measurement / Benchmark Tool☆115Updated 3 years ago
- TLB Benchmarks☆35Updated 8 years ago
- Chai☆47Updated last month
- The SHOC Benchmark Suite☆259Updated 3 months ago
- Flexible GPGPU instrumentation☆89Updated 6 years ago
- tools to create performance and roofline plots from measured data☆60Updated 11 years ago
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆32Updated 3 years ago
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆59Updated 12 years ago
- Test the non-AVX, AVX2 and AVX-512 speeds across various active core counts☆230Updated last year
- ☆48Updated 5 years ago
- Parallelized and vectorized SpMV on Intel Xeon Phi (Knights Landing, AVX512, KNL)☆24Updated last year
- Tools and extensions for CUDA profiling☆65Updated 5 years ago
- GPUOCelot: A dynamic compilation framework for PTX☆289Updated 2 years ago
- The SparseX sparse kernel optimization library☆43Updated 6 years ago
- A collection of performance analysis tools, recipes, handy scripts, microbenchmarks & more☆143Updated 6 months ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆64Updated 4 months ago
- Graph500 reference implementations☆181Updated 3 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆109Updated 8 years ago
- Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's☆91Updated last year
- Instruction THroughput Estimator using MAchine Learning (ITHEMAL)☆152Updated 4 years ago
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆69Updated last year
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆138Updated 2 years ago
- ☆197Updated this week
- ☆320Updated 3 weeks ago
- Scalable GPU Kernel Fission/Fusion Transformation for Memory-Bound Kernels☆14Updated 10 years ago
- ☆34Updated 3 years ago
- A Benchmark Suite for Heterogeneous System Computation☆55Updated 10 months ago