chen0031 / AVX-AVX2-Example-Code
Example code for Intel AVX / AVX2 intrinsics.
☆20Updated 6 years ago
Alternatives and similar repositories for AVX-AVX2-Example-Code:
Users that are interested in AVX-AVX2-Example-Code are comparing it to the libraries listed below
- Chinese version for Agner Fog's optimizing series☆80Updated 6 years ago
- ☆32Updated 3 years ago
- Spack package repository maintained by Student Cluster Competition Team @ Sun Yat-sen University.☆16Updated 5 months ago
- ☆109Updated last year
- This is an implementation of sgemm_kernel on L1d cache.☆226Updated last year
- Example code for Intel AVX / AVX2 intrinsics.☆137Updated last year
- Rebuild YatSenOS On RISC-V 64.☆19Updated 3 years ago
- CUDA PTX-ISA Document 中文翻译版☆38Updated last month
- Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.☆144Updated 3 years ago
- Domain-specific framework for performance analysis of parallel programs☆16Updated 2 months ago
- Documentation for HPC course☆148Updated last week
- ☆41Updated 3 years ago
- ngAP's artifact for ASPLOS'24☆23Updated 3 months ago
- Wiki fo HPC☆112Updated 3 months ago
- ☆70Updated 2 years ago
- A highly efficient library for GEMM operations on Sunway TaihuLight☆17Updated 4 years ago
- x86-64 SIMD矢量优化系列教程☆119Updated 3 weeks ago
- ☆21Updated 2 years ago
- 个人翻译《Data Parallel C++》☆74Updated 3 years ago
- 使用 C++ 模板元编程模拟 Lisp☆111Updated 4 years ago
- Documentation for YatCPU☆50Updated last year
- Personal Notes for Learning HPC & Parallel Computation [Active Adding New Content]☆65Updated 2 years ago
- ☆26Updated last year
- ☆36Updated 3 months ago
- Xiao's CUDA Optimization Guide [Active Adding New Contents]☆288Updated 2 years ago
- ☆10Updated 2 years ago
- PerFlow-AI is a programmable performance analysis, modeling, prediction tool for AI system.☆19Updated this week
- ☆138Updated 4 months ago
- HPC-roadmap for 2021 recruitment☆41Updated last month
- ☆88Updated 3 weeks ago