satishphd / Teaching-Intel-Intrinsics-for-SIMD-ParallelismLinks
Teaching Vectorization and SIMD using Intel Intrinsics in a Computer Organization and Architecture class
☆15Updated 4 months ago
Alternatives and similar repositories for Teaching-Intel-Intrinsics-for-SIMD-Parallelism
Users that are interested in Teaching-Intel-Intrinsics-for-SIMD-Parallelism are comparing it to the libraries listed below
Sorting:
- ☆58Updated 3 weeks ago
- SYCL Reference Manual☆28Updated last year
- Little OpenMP Library☆162Updated 2 years ago
- A header only library implementing common mathematical functions using SIMD intrinsics☆108Updated 4 months ago
- InstLatX64_Demo☆43Updated last month
- ☆30Updated 2 years ago
- SYCL Benchmark Suite☆65Updated this week
- SYCL Conformance Tests☆70Updated this week
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆55Updated 3 months ago
- The Berkeley Container Library☆124Updated last year
- TPP experimentation on MLIR for linear algebra☆131Updated this week
- Task graph-based asynchronous programming system using C++ coroutine☆91Updated last year
- pika is a C++ tasking library built on std::execution with fibers, CUDA, HIP, and MPI support.☆73Updated last week
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆56Updated 2 years ago
- ☆150Updated last week
- A fast implementation of log() and exp()☆53Updated 2 years ago
- Generate SQL from TableGen code - This is part of the tutorial "How to write a TableGen backend" in 2021 LLVM Developers' Meeting.☆29Updated 2 years ago
- SYCL Open Source Specification☆136Updated last week
- Library for lock-free locks☆82Updated 2 years ago
- Official BOLT Repository☆29Updated 10 months ago
- performance experiments for C++ exception handling☆30Updated 3 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆119Updated last week
- ☆63Updated 6 years ago
- "Hardware, Software, and Compilers! Oh My!" tutorial files☆16Updated 5 years ago
- A minimal (really) out-of-tree MLIR example☆44Updated 2 weeks ago
- ☆144Updated 3 weeks ago
- Collaborating on papers for the ISO C++ committee - public repo☆26Updated 10 months ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆40Updated 3 years ago
- Short examples illustrating AVX2 intrinsics for simple tasks.☆95Updated last year
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆65Updated 8 months ago