chen0031 / AVX-AVX2-Example-Code
Example code for Intel AVX / AVX2 intrinsics.
☆19Updated 5 years ago
Related projects: ⓘ
- ☆32Updated 2 years ago
- Rebuild YatSenOS On RISC-V 64.☆19Updated 2 years ago
- ☆22Updated this week
- This is an implementation of sgemm_kernel on L1d cache.☆212Updated 6 months ago
- 个人翻译《Data Parallel C++》☆68Updated 3 years ago
- ☆100Updated 5 months ago
- ☆71Updated last year
- CS149 xmake version☆35Updated 9 months ago
- ☆20Updated this week
- ☆38Updated 3 years ago
- ☆13Updated 5 years ago
- Notes of computer science courses☆24Updated 4 years ago
- Spack package repository maintained by Student Cluster Competition Team @ Sun Yat-sen University.☆14Updated 2 months ago
- Documentation for YatCPU☆47Updated 10 months ago
- x86-64 SIMD矢量优化系列教程☆101Updated 2 months ago
- 华科七边形,欢迎各位朋友的指导与交流。☆27Updated 3 years ago
- Documentation for HPC course☆127Updated 3 months ago
- A toy compiler written in C++17 that translates SysY (a C-like toy language) into ARM-v7a assembly.☆136Updated 3 years ago
- Personal Notes for Learning HPC & Parallel Computation [Active Adding New Content]☆56Updated 2 years ago
- ☆22Updated 6 months ago
- Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.☆103Updated 2 years ago
- DGEMM on KNL, achieve 75% MKL☆15Updated 2 years ago
- 使用 C++ 模板元编程模拟 Lisp☆108Updated 4 years ago
- How to optimize sgemm in single-thread ARM cpu, mutli-threads ARM cpu and Nvidia gpu☆15Updated 3 years ago
- 操作系统实验☆9Updated 5 years ago
- ☆19Updated 10 months ago
- This repository records the experiment of Parallel Computing Class in 2018 SE SCUT.☆24Updated 5 years ago
- An implementation of HPL-AI Mixed-Precision Benchmark based on hpl-2.3☆27Updated 3 years ago
- HPC-roadmap for 2021 recruitment☆36Updated last year
- ☆12Updated last year