satishphd / Teaching-Intel-Intrinsics-for-SIMD-Parallelism
Teaching Vectorization and SIMD using Intel Intrinsics in a Computer Organization and Architecture class
☆12Updated 2 weeks ago
Alternatives and similar repositories for Teaching-Intel-Intrinsics-for-SIMD-Parallelism:
Users that are interested in Teaching-Intel-Intrinsics-for-SIMD-Parallelism are comparing it to the libraries listed below
- A fast implementation of log() and exp()☆53Updated 2 years ago
- Little OpenMP Library☆157Updated 2 years ago
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆56Updated 4 months ago
- Code examples for tutoring modern C++☆92Updated last month
- SYCL Reference Manual☆27Updated 10 months ago
- ☆56Updated 2 weeks ago
- The Farm-SVE package provides a header that implements the ARM C language extensions (ACLE) for the ARM Scalable Vector Extension (SVE) i…☆14Updated last year
- pika is a C++ tasking library built on std::execution with fibers, CUDA, HIP, and MPI support.☆68Updated this week
- Library with JIT (Just-in-time) compilation support to optimize performance of small and medium matrix multiplication☆14Updated 3 years ago
- The Fancy Named Parameters Library☆30Updated 3 months ago
- A comparative, extendable benchmarking suite for C and C++ hash-table libraries.☆32Updated 9 months ago
- ☆136Updated last month
- Task graph-based asynchronous programming system using C++ coroutine☆87Updated last year
- Companion Repository for the Lecture Slides for the Clang Libraries☆87Updated last year
- ☆20Updated 2 years ago
- Programatically obtain information about the pages backing a given memory region☆74Updated 3 years ago
- C++20 Coroutines and io_uring☆48Updated 2 years ago
- A header only library implementing common mathematical functions using SIMD intrinsics☆98Updated 2 weeks ago
- x86-64, ARM, and RVV intrinsics viewer☆42Updated last week
- AVX-512 documentation beyond what Intel provides☆47Updated last year
- Header-only C++ library for low precision floating point type emulation.☆168Updated 5 years ago
- Boost.org proto module☆21Updated 2 months ago
- Library for lock-free locks☆77Updated last year
- SYCL Benchmark Suite☆61Updated last week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆83Updated last week
- ☆28Updated 2 years ago
- C++20 Static Branch library☆52Updated 6 months ago
- A minimal (really) out-of-tree MLIR example☆40Updated this week
- ☆17Updated 8 years ago
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆81Updated last year