chen0031 / AVX-AVX2-Example-CodeLinks
Example code for Intel AVX / AVX2 intrinsics.
☆20Updated 6 years ago
Alternatives and similar repositories for AVX-AVX2-Example-Code
Users that are interested in AVX-AVX2-Example-Code are comparing it to the libraries listed below
Sorting:
- ☆32Updated 3 years ago
- Chinese version for Agner Fog's optimizing series☆80Updated 6 years ago
- 个人翻译《Data Parallel C++》☆75Updated 3 years ago
- This is an implementation of sgemm_kernel on L1d cache.☆228Updated last year
- x86-64 SIMD矢量优化系列教程☆121Updated 2 months ago
- CS149 xmake version☆41Updated last year
- Spack package repository maintained by Student Cluster Competition Team @ Sun Yat-sen University.☆16Updated 7 months ago
- A lightweight and easy to use async IO library implemented with io_uring and C++20 coroutine.☆13Updated 4 months ago
- ☆113Updated last year
- Example code for Intel AVX / AVX2 intrinsics.☆138Updated last year
- 2022 ECS CloudBuild Distributed Cache Contest - Final Round https://tianchi.aliyun.com/competition/entrance/531982/introduction☆17Updated 2 years ago
- A C++ High Performance Web Server using io_uring and cpp20 coroutine☆123Updated 3 years ago
- ☆70Updated 2 years ago
- Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.☆148Updated 3 years ago
- 《C++模板元编程实战:一个深度学习框架的初步实现》☆183Updated 6 years ago
- ☆14Updated 6 years ago
- Rebuild YatSenOS On RISC-V 64.☆20Updated 3 years ago
- a course to help you learn C++20 coroutine and liburing☆93Updated last week
- allocation visualization in svg graph☆151Updated 11 months ago
- ☆40Updated 4 years ago
- implementation of floating-point radix sorting based on CUDA☆28Updated 5 years ago
- ☆78Updated last month
- C++ interfaces for RDMA access☆77Updated last week
- some demos for cpc☆13Updated 7 years ago
- parallelProgramingProject ========================= 《高级并行程序设计》课程报告代码附录 ----------------------------------- 目录结构:<br> cannon/ cannon算法…☆36Updated 11 years ago
- A toy compiler written in C++17 that translates SysY (a C-like toy language) into ARM-v7a assembly.☆138Updated 3 years ago
- CUDA PTX-ISA Document 中文翻译版☆42Updated last month
- ☆276Updated 4 years ago
- 华科七边形,欢迎各位朋友的指导与交流。☆29Updated 7 months ago
- ☆13Updated 2 years ago